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Table 1: Human GLG FR3 sequences 
! VH1 

! 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 

agg gtc acc atg acc agg gac acg tec ate age aca gee tac atg 

5 ! 81 82 82a 82b 82c 83 84 85 86 87 88 89 90 91 92 

gag ctg age agg ctg aga tct gac gac acg gee gtg tat tac tgt 

! 93 94 95 

gcg aga ga ! l-02# 1 

aga gtc acc att acc agg gac aca tec gcg age aca gee tac atg 

10 gag ctg age age ctg aga tct gaa gac acg get gtg tat tac tgt 

gcg aga ga ! l-03# 2 

aga gtc acc atg acc agg aac acc tec ata age aca gee tac atg 

N : gag ctg age age ctg aga tct gag gac acg gee gtg tat tac tgt 

if**: 

SJ gcg aga gg ! l-08# 3 

% 15 aga gtc acc atg acc aca gac aca tec acg age aca gee tac atg 

|f| gag ctg agg age ctg aga tct gac gac acg gee gtg tat tac tgt 

ff"j gcg aga ga ! 1-1 8 # 4 

aga gtc acc atg acc gag gac aca tct aca gac aca gee tac atg 

gag ctg age age ctg aga tct gag gac acg gee gtg tat tac tgt 



* 20 gca aca ga ! l-24# 5 

m. aga gtc acc att acc agg gac agg tct atg age aca gee tac atg 

B] gag ctg age age ctg aga tct gag gac aca gee atg tat tac tgt 

HI gca aga ta ! l-45# 6 

SS aga gtc acc atg acc agg gac acg tec acg age aca gtc tac atg 

^ 25 gag ctg age age ctg aga tct gag gac acg gee gtg tat tac tgt 

gcg aga ga ! 1-4 6# 7 

aga gtc acc att acc agg gac atg tec aca age aca gee tac atg 
gag ctg age age ctg aga tec gag gac acg gee gtg tat tac tgt 
gcg gca ga ! l-58# 8 
30 aga gtc acg att acc gcg gac gaa tec acg age aca gee tac atg 

gag ctg age age ctg aga tct gag gac acg gee gtg tat tac tgt 
gcg aga ga ! l-69# 9 

aga gtc acg att acc gcg gac aaa tec acg age aca gec tac atg 
gag ctg age age ctg aga tct gag gac acg gee gtg tat tac tgt 
35 gcg aga ga ! l-e# 10. 

aga gtc acc ata acc gcg gac acg tct aca gac aca gee tac atg 
gag ctg age age ctg aga tct gag gac acg gee gtg tat tac tgt 
gca aca ga ! l-f# 11 
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VH2 



agg 


etc 


acc 


ate acc aag 


gac 


acc 


tec 


aaa 


aac 


cag 


gtg 


gtc ctt 


aca 


atg 


acc 


aac atg gac 


cct 


gtg 


gac 


aca 


gec 


aca 


tat 


tac tgt 


gca 


cac 


aga 


c! 2-05# 12 


















agg 


etc 


acc 


ate tec aag 


gac 


acc 


tec 


aaa 


age 


cag 


gtg 


gtc ctt 


acc 


atg 


acc 


aac atg gac 


cct 


gtg 


gac 


aca 


gec 


aca 


tat 


tac tgt 


gca 


egg 


ata 


c! 2-26# 13 


















agg 


etc 


acc 


ate tec aag 


gac 


acc 


tec 


aaa 


aac 


cag 


gtg 


gtc ctt 


aca 


atg 


acc 


aac atg gac 


cct 


gtg 


gac 


aca 


gee 


acg 


tat 


tac tgt 


gca 


egg 


ata 


c! 2-70# 14 


















/H3 
























cga 


ttc 


acc 


ate tec aga 


gac 


aac 


gee 


aag 


aac 


tea 


ctg 


tat ctg 


caa 


atg 


aac 


age ctg aga 


gee 


gag 


gac 


acg 


get 


gtg 


tat 


tac tgt 


gcg 


aga 


ga ! 3-07# 15 


















cga 


ttc 


acc 


ate tec aga 


gac 


aac 


gec 


aag 


aac 


tec 


ctg 


tat ctg 


caa 


atg 


aac 


agt ctg aga 


get 


gag 


gac 


acg 


gee 


ttg 


tat 


tac tgt 


gca 


aaa 


gat 


a! 3-09#16 


















cga 


ttc 


acc 


ate tec agg 


gac 


aac 


gee 


aag 


aac 


tea 


ctg 


tat ctg 


caa 


atg 


aac 


age ctg aga 


gee 


gag 


gac 


acg 


gee 


gtg 


tat 


tac tgt 


gcg 


aga 


ga ! 


3-ll# 17 


















cga 


ttc 


acc 


ate tec aga 


gaa 


aat 


gee 


aag 


aac 


tec 


ttg 


tat ctt 


caa 


atg 


aac 


age ctg aga 


gee 


ggg 


gac 


acg 


get 


gtg 


tat 


tac tgt 


gca 


aga 


ga ! 


1 3-13# 18 


















aga 


ttc 


acc 


ate tea aga 


gat 


gat 


tea 


aaa 


aac 


acg 


ctg 


tat ctg 


caa 


atg 


aac 


age ctg aaa 


acc 


gag 


gac 


aca 


gee 


gtg 


tat 


tac tgt 


acc 


aca 


ga ! 


! 3-15# 19 


















cga 


ttc 


acc 


ate tec aga 


gac 


aac 


gee 


aag 


aac 


tec 


ctg 


tat ctg 


caa 


atg 


aac 


agt ctg aga 


gee 


gag 


gac 


acg 


gec 


ttg 


tat 


cac tgt 


gcg 


aga 


ga ! 


! 3-20# 20 


















cga 


ttc 


acc 


ate tee aga 


gac 


aac 


gee 


aag 


aac 


tea 


ctg 


tat ctg 


caa 


atg 


aac 


age ctg aga 


gee 


gag 


gac 


acg 


get 


gtg 


tat 


tac tgt 


gcg 


aga 


ga ! 


! 3-21# 21 


















egg 


ttc 


acc 


ate tec aga 


gac 


aat 


tec 


aag 


aac 


acg 


ctg 


tat ctg 


caa 


atg 


aac 


age ctg aga 


gee 


gag 


gac 


acg 


gee 


gta 


tat 


tac tgt 


gcg 


aaa 


ga 


! 3-23# 22 


















cga 


ttc 


acc 


ate tec aga 


gac 


aat 


tec 


aag 


aac 


acg 


ctg 


tat ctg 


caa 


atg 


aac 


age ctg aga 


get 


gag 


gac 


acg 


get 


gtg 


tat 


tac tgt 


gcg 


aaa 


ga 


! 3-30# 23 


















cga 


ttc 


acc 


ate tec aga 


gac 


aat 


tec 


aag 


aac 


acg 


ctg 


tat ctg 


caa 


atg 


aac 


age ctg aga 


get 


gag 


gac 


acg 


get 


gtg 


tat 


tac tgt 



gcg aga ga 
cga ttc acc 
caa atg aac 
gcg aaa ga 
cga ttc acc 
caa atg aac 
gcg aga ga 
cga ttc acc 
caa atg aac 
gca aaa gat 
cga ttc acc 
caa atg aac 
gcg aga ga 
aga ttc acc 
caa atg aac 
act aga ga 
cga ttc acc 
caa atg aac 
gcg aga ga 
aga ttc acc 
caa atg ggc 
gcg aga ga 
aga ttc acc 
caa atg aac 
gcg aga ga 
aga ttc acc 
caa atg aac 
get aga ga 
agg ttc acc 
caa atg aac 
act aga ca 
cga ttc acc 
caa atg aac 
gca aga ga 
aga ttc acc 
caa atg aac 
aag aaa ga 
VH4 
cga gtc acc 
aag ctg age 



! 3303# 24 
ate tec aga 
age ctg aga 

! 3305# 25 
ate tec aga 
age ctg aga 

! 3-33# 26 
ate tec aga 
agt ctg aga 
a! 3-43#27 
ate tec aga 
age ctg aga 

! 3-48# 28 
ate tea aga 
age ctg aaa 

! 3-49# 29 
ate tec aga 
age ctg aga 

! 3-53# 30 
ate tec aga 
age ctg aga 

! 3-64# 31 
ate tec aga 
age ctg aga 

! 3-66# 32 
ate tea aga 
age ctg aaa 

! 3-72# 33 
ate tec aga 
age ctg aaa 

! 3-73# 34 
ate tec aga 
agt ctg aga 

! 3-74# 35 
ate tec aga 
age ctg aga 

I 3-d# 36 

ata tea gta 
tct gtg acc 



gac aat tec 
get gag gac 

gac aat tec 
gec gag gac 

gac aac age 
act gag gac 

gac aat gee 
gac gag gac 

gat ggt tec 
acc gag gac 

gac aat tec 
gee gag gac 

gac aat tec 
get gag gac 

gac aat tec 
get gag gac 

gat gat tea 
acc gag gac 

gat gat tea 
acc gag gac 

gac aac gee 
gee gag gac 

gac aat tec 
get gag gac 

gac aag tec 
gee gcg gac 



aag aac acg 
acg get gtg 

aag aac acg 
acg get gtg 

aaa aac tec 
acc gee ttg 

aag aac tea 
acg get gtg 

aaa age ate 
aca gee gtg 

aag aac acg 
acg gee gtg 

aag aac acg 
atg get gtg 

aag aac acg 
acg get gtg 

aag aac \tca 
acg gee gtg 

aag aac acg 
acg gee gtg 

aag aac acg 
acg get gtg 

aag aac acg 
acg get gtg 

aag aac cag 
acg gee gtg 



ctg tat ctg 
tat tac tgt 

ctg tat ctg 
tat tac tgt 

ctg tat ctg 
tat tac tgt 

ctg tat ctg 
tat tac tgt 

gee tat ctg 
tat tac tgt 

ctg tat ctt 
tat tac tgt 

ctg tat ctt 
tat tac tgt 

ctg tat ctt 
tat tac tgt 

ctg tat ctg 
tat tac tgt 

gcg tat ctg 
tat tac tgt 

ctg tat ctg 
tat tac tgt 

ctg cat ctt 
tat tac tgt 

ttc tec ctg 
tat tac tgt 
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gcg aga ga ! 4-04# 37 



cga 


gtc 


acc 


atg tea gta 


gac 


acg 


tec 


aag 


aac 


cag 


ttc 


tec 


ctg 


aag 


ctg 


age 


tct gtg acc 


gcc 


gtg 


gac 


acg 


gcc 


gtg 


tat 


tac 


tgt 


gcg 


aga 


aa 


! 4-28# 38 




















cga 


gtt 


acc 


ata tea gta 


gac 


acg 


tct 


aag 


aac 


cag 


ttc 


tec 


ctg 


aag 


ctg 


age 


tct gtg act 


gcc 


gcg 


gac 


acg 


gcc 


gtg 


tat 


tac 


tgt 


gcg 


aga 


ga 


! 4301# 39 




















cga 


gtc 


acc 


ata tea gta 


gac 


agg 


tec 


aag 


aac 


cag 


ttc 


tec 


ctg 


aag 


ctg 


age 


tct gtg acc 


gcc 


gcg 


gac 


acg 


gcc 


gtg 


tat 


tac 


tgt 


gcc 


aga 


ga 


! 4302# 40 




















cga 


gtt 


acc 


ata tea gta 


gac 


acg 


tec 


aag 


aac 


cag 


ttc 


tec 


ctg 


aag 


ctg 


age 


tct gtg act 


gcc 


gca 


gac 


acg 


gcc 


gtg 


tat 


tac 


tgt 


gcc 


aga 


ga 1 


! 4304# 41 




















cga 


gtt 


acc 


ata tea gta 


gac 


acg 


tct 


aag 


aac 


cag 


ttc 


tec 


ctg 


aag 


ctg 


age 


tct gtg act 


gcc 


gcg 


gac 


acg 


gcc 


gtg 


tat 


tac 


tgt 


gcg 


aga 


ga ! 


! 4~31# 42 




















cga 


gtc 


acc 


ata tea gta 


gac 


acg 


tec 


aag 


aac 


cag 


ttc 


tec 


ctg 


aag 


ctg 


age 


tct gtg acc 


gcc 


gcg 


gac 


acg 


get 


gtg 


tat 


tac 


tgt 


gcg 


aga 


ga ! 


! 4-34# 43 




















cga 


gtc 


acc 


ata tec gta 


gac 


acg 


tec 


aag 


aac 


cag 


ttc 


tec 


ctg 


aag 


ctg 


age 


tct gtg acc 


gcc 


gca 


gac 


acg 


get 


gtg 


tat 


tac 


tgt 


gcg 


aga 


ca ! 


! 4-39# 44 




















cga 


gtc 


acc 


ata tea gta 


gac 


acg 


tec 


aag 


aac 


cag 


ttc 


tec 


ctg 


aag 


ctg 


age 


tct gtg acc 


get 


gcg 


gac 


acg 


gcc 


gtg 


tat 


tac 


tgt 


gcg 


aga 


ga ! 


! 4-59# 45 




















cga 


gtc 


acc 


ata tea gta 


gac 


acg 


tec 


aag 


aac 


cag 


ttc 


tec 


ctg 


aag 


ctg 


age 


tct gtg acc 


get 


gcg 


gac 


acg 


gcc 


gtg 


tat 


tac 


tgt 


gcg 


aga 


ga ! 


! 4-61# 46 




















cga 


gtc 


acc 


ata tea gta 


gac 


acg 


tec 


aag 


aac 


cag 


ttc 


tec 


ctg 


aag 


ctg 


age 


tct gtg acc 


gcc 


gca 


gac 


acg 


gcc 


gtg 


tat 


tac 


tgt 


gcg 


aga 


ga : 


! 4-b# 47 














































cag 


gtc 


acc 


ate tea gcc 


gac 


aag 


tec 


ate 


age 


acc 


gcc 


tac 


ctg 


cag 


tgg 


age 


age ctg aag 


gcc 


teg 


gac 


acc 


gcc 


atg 


tat 


tac 


tgt 


gcg 


aga 


ca 


! 5-51# 48 




















cac 


gtc 


acc 


ate tea get 


gac 


aag 


tec 


ate 


age 


act 


gcc 


tac 


ctg 


cag 


tgg 


age 


age ctg aag 


gcc 


teg 


gac 


acc 


gcc 


atg 


tat 


tac 


tgt 


gcg 


aga 


! 5-a# 49 




















vh6 


























cga 


ata 


acc 


ate aac cca 


gac 


aca 


tec 


aag 


aac 


cag 


ttc 


tec 


ctg 
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cag ctg aac tct gtg act ccc 
gca aga ga ! 6-l# 50 
! VH7 

egg ttt gtc ttc tec ttg gac 
5 cag ate tgc age eta aag get 

gcg aga ga ! 74. 1# 51 



gag gac acg get gtg tat tac tgt 

ace tct gtc age acg gca tat ctg 
gag gac act gee gtg tat tac tgt 



■XT- 

m 

SR.! 

I "ft 
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Table 2: Enzymes that either cut 15 or more human GLGs or have 5+-base recognition in FR3 
Typical entry: 

REname Recognition #sites 

GLGid# : base# GLGid# : base# GLGid# : base# 



BstEII Ggtnacc 2 
1: 3 48: 3 
There are 2 hits at base# 3 



10 Maelll gtnac 



36 



|4 



m 

If: 



¥-4:.; 



15 



20 



25 



1: 4 


2: 


4 


3: 


4 


4: 


4 


5: 


4 


6: 


4 


7: 4 


8: 


4 


9: 


4 


10: 


4 


11: 


4 


37: 


4 


37: 58 


38: 


4 


38: 


58 


39: 


4 


39: 


58 


40: 


4 


40: 58 


41: 


4 


41: 


58 


42: 


4 


42: 


58 


43: 


4 


43: 58 


44: 


4 


44: 


58 


45: 


4 


45: 


58 


46: 


4 


46: 58 


47: 


4 


47: 


58 


48: 


4 


49: 


4 


50: 


58 


There are 24 


hits 


at 


base# 4 












?sp45I gtsac 










33 










1: 4 


2: 


4 


3: 


4 


4: 


4 


5: 


4 


6: 


4 


7: 4 


8: 


4 


9: 


4 


10: 


4 


11: 


4 


37: 


4 


37: 58 


38: 


4 


38: 


58 


39: 


58 


40: 


4 


40: 


58 


41: 58 


42: 


58 


43: 


4 


43: 


58 


44: 


4 


44: 


58 


45: 4 


45: 


58 


46: 


4 


46: 


58 


47: 


4 


47: 


58 


48: 4 


49: 


4 


50: 


58 















There are 21 hits at base# 4 





HphI 


tcacc 










45 












1: 


5 


2: 


5 


3: 


5 


4: 


5 


5: 


5 


6: 


5 


30 


7: 


5 


8: 


5 


11: 


5 


12: 


5 


12: 


11 


13: 


5 




14: 


5 


15: 


5 


16: 


5 


17: 


5 


18: 


5 


19: 


5 




20: 


5 


21: 


5 


22: 


5 


23: 


5 


24: 


5 


25: 


5 




26: 


5 


27: 


5 


28: 


5 


29: 


5 


30: 


5 


31: 


5 




32: 


5 


33: 


5 


34: 


5 


35: 


5 


36: 


5 


37: 


5 


35 


38: 


5 


40: 


5 


43: 


5 


44: 


5 


45: 


5 


46: 


5 




47: 


5 


48: 


5 


49: 


5 















There are 44 hits at base# 5 
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Nlalll CATG 
1:9 1 
4: 42 
7: 42 

12: 57 

48: 78 



42 
5: 9 
8: 21 
13: 48 
49: 78 



2: 42 

5: 42 

8: 42 

13: 57 



26 
3: 9 
6: 42 
9: 42 
14: 57 



3: 42 

6: 78 

10: 42 

31: 72 



4: 


9 


7: 


9 


11: 


42 


38: 


9 



There are 11 hits at base# 42 

There are 1 hits at base# 48 Could cause raggedness. 



10 BsaJI Ccnngg 



37 



15 



20 



25 



1: 


14 


2: 


14 


5: 


14 


6: 


14 


7: 


14 


8: 


14 


8: 


65 


9: 


14 


10: 


14 


11: 


14 


12: 


14 


13: 


14 


14: 


14 


15: 


65 


17: 


14 


17: 


65 


18: 


65 


19: 


65 


20: 


65 


21: 


65 


22: 


65 


26: 


65 


29: 


65 


30: 


65 


33: 


65 


34: 


65 


35: 


65 


37: 


65 


38: 


65 


39: 


65 


40: 


65 


42: 


65 


43: 


65 


48: 


65 


49: 


65 


50: 


65 


51: 


14 























There are 23 hits at 

There are 14 hits at 



Alul AGct 

1: 47 2: 

7: 47 8: 

23: 63 24: 

37: 47 37; 



47 
47 
63 
52 



3: 
9: 
25: 
38: 



base# 65 

base# 14 

42 

47 4: 47 
47 10: 47 
63 31: 63 
47 38: 52 



5: 47 

11: 47 

32: 63 

39: 47 



30 



There are 23 hits at 

There are 11 hits at 



base# 47 

base# 52 Only 



6: 47 

16: 63 

36: 63 

39: 52 



40: 


47 


40: 


52 


41: 


47 


41: 


52 


42: 


47 


42: 


52 


43: 


47 


43: 


52 


44: 


47 


44: 


52 


45: 


47 


45: 


52 


46: 


47 


46: 


52 


47: 


47 


47: 


52 


49: 


15 


50: 


47 



5 bases from 47 



35 



BlpI GCtnagc 21 

1: 48 2: 48 3: 48 5 : 48 

8: 48 9: 48 10: 48 11: 48 

39: 48 40: 48 41: 48 42: 48 

45: 48 46: 48 47: 48 



6: 48 7: 48 
37: 48 38: 48 
43: 48 44: 48 



There are 21 hits at base# 48 
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Mwol GCNNNNNnngc 

1: 48 2: 28 19: 36 
25: 36 26: 36 35: 36 
41: 67 42: 67 43: 67 
47: 67 

There are 10 hits at bas 
There are 7 hits at bas 



19 



22: 


36 


23: 


36 


24: 


36 


37: 


67 


39: 


67 


40: 


67 


44: 


67 


45: 


67 


46: 


67 



4 67 
# 36 



Ddel Ctnag 71 



1: 


49 


1: 


58 


2: 


49 


2: 


58 


3: 


49 


3: 


58 


3: 


65 


4: 


49 


4: 


58 


5: 


49 


5: 


58 


5: 


65 


6: 


49 


6: 


58 


6: 


65 


7: 


49 


7: 


58 


7: 


65 


8: 


49 


8: 


58 


9: 


49 


9: 


58 


9: 


65 


10: 


49 


10: 


58 


10: 


65 


11: 


49 


11: 


58 


11: 


65 


15: 


58 


16: 


58 


16: 


65 


17: 


58 


18: 


58 


20: 


58 


21: 


58 


22: 


58 


23: 


58 


23: 


65 


24: 


58 


24: 


65 


25: 


58 


25: 


65 


26: 


58 


27: 


58 


27: 


65 


28: 


58 


30: 


58 


31: 


58 


31: 


65 


32: 


58 


32: 


65 


35: 


58 


36: 


58 


36: 


65 


37: 


49 


38: 


49 


39: 


26 


39: 


49 


40: 


49 


41: 


49 


42: 


26 


42: 


49 


43: 


49 


44: 


49 


45: 


49 


46: 


49 


47: 


49 


48: 


12 


49: 


12 


51: 


65 







There are 29 hits at base# 58 

There are 22 hits at base# 49 Only nine base from 58 
There are 16 hits at base# 65 Only seven bases from 5 



Bglll Agate t 11 

1: 61 2: 61 3: 61 4: 61 5: 61 6: 61 
7: 61 9: 61 10: 61 11: 61 51: 47 
There are 10 hits at base# 61 



BstYI Rgatcy 12 

1: 61 2: 61 3: 61 4: 61 5: 61 6: 61 
7: 61 8: 61 9: 61 10: 61 11: 61 51: 47 
There are 11 hits at base# 61 
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Hpyl88I TCNga 17 

1: 64 2: 64 3: 64 4: 64 5: 64 6: 64 
7: 64 8: 64 9: 64 10: 64 11: 64 16: 57 

20: 57 27: 57 35: 57 48: 67 49: 67 

There are 11 hits at base# 64 

There are 4 hits at base# 57 

There are 2 hits at base# 67 Could be ragged. 



MslI CAYNNnnRTG 44 



1: 


72 


2: 


72 


3: 


72 


4: 


72 


5: 


72 


6: 


72 


7: 


72 


8: 


72 


9: 


72 


10: 


72 


11: 


72 


15: 


72 


17: 


72 


18: 


72 


19: 


72 


21: 


72 


23: 


72 


24: 


72 


25: 


72 


26: 


72 


28: 


72 


29: 


72 


30: 


72 


31: 


72 


32: 


72 


33: 


72 


34: 


72 


35: 


72 


36: 


72 


37: 


72 


38: 


72 


39: 


72 


40: 


72 


41: 


72 


42: 


72 


43: 


72 


44: 


72 


45: 


72 


46: 


72 


47: 


72 


48: 


72 


49: 


72 


50: 


72 


51: 


72 



















There are 44 hits at base# 72 



BsiEI CGRYcg 23 

1: 74 3: 74 4: 74 5: 74 7: 74 8: 74 

9: 74 10: 74 11: 74 17: 74 22: 74 30: 74 

33: 74 34: 74 37: 74 38: 74 39: 74 40: 74 

41: 74 42: 74 45: 74 46: 74 47: 74 
There are 23 hits at base# 74 

Eael Yggccr 23 

1: 74 3: 74 4: 74 5: 74 7: 74 8: 74 

9: 74 10: 74 11: 74 17: 74 22: 74 30: 74 

33: 74 34: 74 37: 74 38: 74 39: 74 40: 74 

41: 74 42: 74 45: 74 46: 74 47: 74 
There are 23 hits at base# 74 



EagI Cggccg 

3: 74 3: 74 

9: 74 10: 74 

33: 74 34: 74 



23 

4: 74 5: 74 
11: 74 17: 74 
37: 74 38: 74 



7: 74 8: 74 
22: 74 30: 74 
39: 74 40: 74 
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41: 74 42: 74 45: 74 46: 74 47: 74 
There are 23 hits at base# 74 



Haelll GGcc 27 



1: 


75 


3: 


75 


4: 


75 


5: 


75 


7: 


75 


8: 


75 


9: 


75 


10: 


75 


11: 


75 


16: 


75 


17: 


75 


20: 


75 


22: 


75 


30: 


75 


33: 


75 


34: 


75 


37: 


75 


38: 


75 


39: 


75 


40: 


75 


41: 


75 


42: 


75 


45: 


75 


46: 


75 


47: 


75 


48: 


63 


49: 


63 















There are 25 hits at base# 75 



Bst4CI ACNgt 65°C 63 Sites There is a third isoschismer 



1: 


86 


2: 


86 


3: 


86 


4: 


86 


5: 


86 


6: 


86 


7: 


34 


7: 


86 


8: 


86 


9: 


86 


10: 


86 


11: 


86 


12: 


86 


13: 


86 


14: 


86 


15: 


36 


15: 


86 


16: 


53 


16: 


86 


17: 


36 


17: 


86 


18: 


86 


19: 


86 


20: 


53 


20: 


86 


21: 


36 


21: 


86 


22: 


0 


22: 


86 


23: 


86 


24: 


86 


25: 


86 


26: 


86 


27: 


53 


27: 


86 


28: 


36 


28: 


86 


29: 


86 


30: 


86 


31: 


86 


32: 


86 


33: 


36 


33: 


86 


34: 


86 


35: 


53 


35: 


86 


36: 


86 


37: 


86 


38: 


86 


39: 


86 


40: 


86 


41: 


86 


42: 


86 


43: 


86 


44: 


86 


45: 


86 


46: 


86 


47: 


86 


48: 


86 


49: 


86 


50: 


86 


51: 


0 


51: 


86 















There are 51 hits at base# 86 All the other sites are well away 



HpyCH4III ACNgt 63 



1: 


86 


2: 


86 


3: 


86 


4: 


86 


5: 


86 


6: 


86 


7: 


34 


7: 


86 


8: 


86 


9: 


86 


10: 


86 


11: 


86 


12: 


86 


13: 


86 


14: 


86 


15: 


36 


15: 


86 


16: 


53 


16: 


86 


17: 


36 


17: 


86 


18: 


86 


19: 


86 


20: 


53 


20: 


86 


21: 


36 


21: 


86 


22: 


0 


22: 


86 


23: 


86 


24: 


86 


25: 


86 


26: 


86 


27: 


53 


27: 


86 


28: 


36 


28: 


86 


29: 


86 


30: 


86 


31: 


86 


32: 


86 


33: 


36 


33: 


86 


34: 


86 


35: 


53 


35: 


86 


36: 


86 


37: 


86 


38: 


86 


39: 


86 


40: 


86 


41: 


86 


42: 


86 


43: 


86 


44: 


86 


45: 


86 


46: 


86 


47: 


86 


48: 


86 


49: 


86 


50: 


86 


51: 


0 


51: 


86 















- 125 - 



10 



There are 51 hits at base# 86 



Hinfl Gantc 43 



2: 


2 


3: 


2 


4: 


2 


5: 


2 


6: 


2 


7: 


2 


8: 


2 


9: 


2 


9: 


22 


10: 


2 


11: 


2 


15: 


2 


16: 


2 


17: 


2 


18: 


2 


19: 


2 


19: 


22 


20: 


2 


21: 


2 


23: 


2 


24: 


2 


25: 


2 


26: 


2 


27: 


2 


28: 


2 


29: 


2 


30: 


2 


31: 


2 


32: 


2 


33: 


2 


33: 


22 


34: 


22 


35: 


2 


36: 


2 


37: 


2 


38: 


2 


40: 


2 


43: 


2 


44: 


2 


45: 


2 


46: 


2 


47: 


2 



50: 60 

There are 38 hits at base# 



Mlyl GAGT CNNNNNn 



18 



ru 



15 



20 



25 



30 



35 



2: 


2 


3: 


2 


4: 


2 


5: 


2 


6: 


2 


7: 


2 


8: 


2 


9: 


2 


10: 


2 


11: 


2 


37: 


2 


38: 


2 


40: 


2 


43: 


2 


44: 


2 


45: 


2 


46: 


2 


47: 


2 



There are 18 hits at base# 2 
Plel gagtc 18 



There are 18 hits at base! 2 



62 



Acil Ccgc 

2: 26 9: 14 
37: 65 38: 62 
42: 65 43: 62 
46: 62 47 
There are 
There are 
There are 
There are 
There are 



10: 14 
39: 65 
43: 65 



24 
11: 14 
40: 62 



27: 
40: 



74 
65 



44: 62 44: 65 



There are 
-"- Gcgg 
8: 91 



8 hits at base# 62 
8 hits at base# 65 

3 hits at base# 14 
3 hits at base# 74 
1 hits at base# 26 
1 hits at base# 35 

11 

16 10: 16 11: 16 



2: 


2 


3: 


2 


4: 


2 


5: 


2 


6: 


2 


7: 


2 


8: 


2 


9: 


2 


10: 


2 


11: 


2 


37: 


2 


38: 


2 


40: 


2 


43: 


2 


44: 


2 


45: 


2 


46: 


2 


47: 


2 



37: 62 
41: 65 
45: 62 



47: 65 48: 35 48: 74 49: 74 



37: 67 39: 67 
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40: 67 42: 67 43: 67 45: 67 46: 67 

There are 7 hits at base# 67 

There are 3 hits at base# 16 

There are 1 hits at base# 91 

5 

BsiHKAI GWGCWc 20 

2: 30 4: 30 6: 30 7: 30 9: 30 10: 30 

12: 89 13: 89 14: 89 37: 51 38: 51 39: 51 

40: 51 41: 51 42: 51 43: 51 44: 51 45: 51 
10 46: 51 47: 51 

There are 11 hits at base# 51 

Bspl286l GDGCHc 20 

2: 30 4: 30 6: 30 7: 30 9: 30 10: 30 

15 12: 89 13: 89 14: 89 37: 51 38: 51 39: 51 

40: 51 41: 51 42: 51 43: 51 44: 51 45: 51 
46: 51 47: 51 

There are 11 hits at base# 51 

20 HgiAI GWGCWc 2 0 

2: 30 4: 30 6: 30 7: 30 9: 30 10: 30 

12: 89 13: 89 14: 89 37: 51 38: 51 39: 51 

40: 51 41: 51 42: 51 43: 51 44: 51 45: 51 
46: 51 47: 51 

25 There are 11 hits at base# 51 

BsoFI GCngc 26 

2: 53 3: 53 5: 53 6: 53 7: 53 8: 53 

8: 91 9: 53 10: 53 11: 53 31: 53 36: 36 

30 37: 64 39: 64 40: 64 41: 64 42: 64 43: 64 

44: 64 45: 64 46: 64 47: 64 48: 53 49: 53 
50: 45 51: 53 

There are 13 hits at base# 53 

There are 10 hits at base# 64 

35 Tsel Gcwgc 17 

2: 53 3: 53 5: 53 6: 53 7: 53 8: 53 

9: 53 10: 53 11: 53 31: 53 36: 36 45: 64 
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46: 64 48: 53 49: 53 50: 45 51: 53 
There are 13 hits at base# 53 



Mnll gagg 34 



3: 


67 


3: 


95 


4: 


51 


5: 


16 


5: 


67 


6: 


67 


7: 


67 


8: 


67 


9: 


67 


10: 


67 


11: 


67 


15: 


67 


16: 


67 


17: 


67 


19: 


67 


20: 


67 


21: 


67 


22: 


67 


23: 


67 


24: 


67 


25: 


67 


26: 


67 


27: 


67 


28: 


67 


29: 


67 


30: 


67 


31: 


67 


32: 


67 


33: 


67 


34: 


67 


35: 


67 


36: 


67 


50: 


67 


51: 


67 











There are 31 hits at base# 67 



HpyCH4V TGca 34 



5: 


90 


6: 


90 


11: 


90 


12: 


90 


13: 


90 


14: 


90 


15: 


44 


16: 


44 


16: 


90 


17: 


44 


18: 


90 


19: 


44 


20: 


44 


21: 


44 


22: 


44 


23: 


44 


24: 


44 


25: 


44 


26: 


44 


27: 


44 


27: 


90 


28: 


44 


29: 


44 


33: 


44 


34: 


44 


35: 


44 


35: 


90 


36: 


38 


48: 


44 


49: 


44 


50: 


44 


50: 


90 


51: 


44 


51: 


52 











There are 21 hits at base# 44 

There are 1 hits at base# 52 



AccI GTmkac 13 5-base recognition 

7: 37 11: 24 37: 16 38: 16 39: 16 40: 16 
41: 16 42: 16 43: 16 44: 16 45: 16 46: 16 
47: 16 

There are 11 hits at base# 16 

SacII CCGCgg 8 6-base recognition 

9: 14 10: 14 11: 14 37: 65 39: 65 40: 65 
42: 65 43: 65 

There are 5 hits at base# 65 
There are 3 hits at base# 14 



Tfil Gawtc 24 

9: 22 15: 2 16: 2 17: 2 18: 2 19: 2 
19: 22 20: 2 21: 2 23: 2 24: 2 25: 2 
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26: 2 27: 2 28: 2 29: 2 30: 2 31: 2 
32: 2 33: 2 33: 22 34: 22 35: 2 36: 2 
There are 20 hits at base# 2 

BsmAI Nnnnnngagac 19 
15: 11 16: 11 20: 11 21: 11 22: 11 23: 11 
24: 11 25: 11 26: 11 27: 11 28: 11 28: 56 
30: 11 31: 11 32: 11 35: 11 36: 11 44: 87 
48: 87 

There are 16 hits at base# 11 

Bpml ctccag 19 

15: 12 16: 12 17: 12 18: 12 20: 12 21: 12 

22: 12 23: 12 24: 12 25: 12 26: 12 27: 12 

28: 12 30: 12 31: 12 32: 12 34: 12 35: 12 
36: 12 

There are 19 hits at base# 12 

XmnI GAANNnnttc 12 
37: 30 38: 30 39: 30 40: 30 41: 30 42: 30 
43: 30 44: 30 45: 30 46: 30 47: 30 50: 30 
There are 12 hits at base! 30 

BsrI NCcagt 12 
37: 32 38: 32 39: 32 40: 32 41: 32 42: 32 
43: 32 44: 32 45: 32 46: 32 47: 32 50: 32 
There are 12 hits at base# 32 

Banll GRGCYc 11 
37: 51 38: 51 39: 51 40: 51 41: 51 42: 51 
43: 51 44: 51 45: 51 46: 51 47: 51 
There are 11 hits at base# 51 

EC1136I GAGctc 11 

37: 51 38: 51 39: 51 40: 51 41: 51 42: 51 

43: 51 44: 51 45: 51 46: 51 47: 51 

There are 11 hits at base# 51 
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Sad GAGCTc 11 

37: 51 38: 51 39: 51 40: 51 41: 51 42: 51 

43: 51 44: 51 45: 51 46: 51 47: 51 

There are 11 hits at base# 51 



u 

4;: 

s as; 



m 
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Table 3: Synthetic 3-23 FR3 of human heavy chains showning positions of possible cleavage sites 



! Sites engineered into the synthetic gene are shown in upper case 
DNA 

! with the RE name between vertical bars (as in j Xbal ! ) . 
5 ! RERSs frequently found in GLGs are shown below the synthetic 
sequence 

! with the name to the right (as in gtn ac=MaeIII (24 ) , indicating 
that 

24 of the 51 GLGs contain the site) . 



35 



10 



# in 
i 

15 synthetic 3-23) 
Allowed DNA 



20 Hinfl(38) 

! 

Plel (18) 
! 

Tfil(20) 
25 ! 

Maelll (24) 
! 

Tsp45l(21) 



I FR3 

89 90 (codon 

R F 

Icgclttcl 6 
I cgn| ttyl 
I agr | 

ga ntc = 

ga gtc = 

ga wtc = 

gtn ac = 

gts ac = 

tc acc = 



30 Hphl(44) 



-FR3- 



91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 
TISRDNSKNTLYLQM 
I act | ate | TCT I AGA | gac I aac I tct | aag I aat I act | etc | tac | ttg I cag | atg ! 



51 



! allowed ! acn I ath | ten I cgn I gay 1 aay I ten I aar | aay I acn i ttr ! tay 1 ttr I car I atg I 
lagyiagrj lagyl |ctn| tctnj 

I galgac = BsmAI(16) ag ct = 

40 Alul{23) 
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BlpI (21) 



cltcc ag = Bpml(19) 



! Xbal i 



g ctn age = 



g aan nnn ttc = Xmnl{12) 
tg ca 



HpyCH4V(21) 

t 

! FR3 >1 

! 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 

! NS LRAE DTAVYYCAK 

I aac | agC | TTA | AGg j get | gag | gac ! aCT j GCA I Gtc f tac | tat I tgc | get I aaa | 
! allowed I aay | ten [ ttr i cgn | gen | gar 1 gay | acn | gen | gtn [ tay j tay | tgy I gen | aar I 
I | agy I ctn j agr | I I 



96 



HpyCH4III(51) 



I cc nng g - BsaJI{23) 
aga tct = Bglll (10) | 



ac ngt - Bst4CI (51) 
ac ngt = 



Rga tcY = BstYI(ll) i ac ngt 

c ayn nnn rtc = Msll(44) 

eg rye g = BsiEI (23) 
yg gee r = Eael (23) 
eg gee g = EagI (23) 
fg gee = HaeIII{25) 
gag g = Mnll (31) I 
Aflll | [ PstI 



Taal (51) 
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Table 4: REdaptors, Extenders, and Bridges used for Cleavage and 

Capture of Human Heavy Chains in FR3 . 

A: HpyCH4V Probes of actual human HC genes 

!HpyCH4V in FR3 of human HC, bases 35-56; only those with TGca site 

TGca;10, 

RE recognition: tgca of length 4 is expected at 

10 



1 




6-1 


agttctccctgcagctgaactc 


2 


3-11,3-07,3-21,3- 


-72,3-48 


cactgtatctgcaaatgaacag 


3 


3-09,3- 


-43,3-20 


ccctgtatctgcaaatgaacag 


4 




5-51 


ccgcctacctgcagtggagcag 


5 


3-15,3-30,3-30.5,3-30.3,3-74,3- 


-23,3-33 


cgctgtatctgcaaatgaacag 


6 




7-4.1 


cggcatatctgcagatctgcag 


7 




3-73 


cggcgtatctgcaaatgaacag 


8 




5-a 


ctgcctacctgcagtggagcag 


9 




3-49 


tcgcctatctgcaaatgaacag 



B: HpyCH4V REdaptors , Extenders, and Bridges 
B . 1 REdaptors 

! Cutting HC lower strand: 

! TmKeller for 100 mM NaCl, zero formamide 
! Edapters for cleavage 



(ON_HCFR36-l) 

(0N_HCFR36-1A) 

(0N_HCFR36-1B) 

(ON_HCFR33-15) 

(ON_HCFR33-15A) 

(ON_HCFR33-15B) 

(ON_HCFR33-ll) 

(ON HCFR35-51) 



5 1 -agttctcccTGCAgctgaactc-3 ? 
5 ' -ttctcccTGCAgctgaactc-3 1 
5 1 -ttctcccTGCAgctgaac-3 T 

5 1 -cgctgtatcTGCAaatgaacag-3 1 
5 1 -ctgtatcTGCAaatgaacag-3 ' 
5 1 -ctgtatcTGCAaatgaac-3 ' 

5 ' -cactgtatcTGCAaatgaacag-3 1 

5 1 -ccgcctaccTGCAgtggagcag-3 1 



m W 

68.0 
62.0 
56.0 
64.0 
56.0 
50.0 
62.0 
74.0 



rp K 

64.5 
62.5 
59.9 
60.8 
56.3 
53.1 
58.9 
70.1 



B.2 Segment of synthetic 3-23 gene into which captured CDR3 is to 
be cloned 

Xbal. . . 

D323* cgCttcacTaag tcT aqa gac aaC tcT aag aaT acT etc taC 
scab designed gene 3-23 gene 



HpyCH4V 



Aflll. 
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Ttg caG atg aac ag e TtA aq G . . . 



B . 3 Extender and Bridges 

! Extender (bottom strand) : 
j 

(ON_HCHpyEx01 ) 5 T -cAAgTAgAgAgTATTcTTAgAgTTgTcTcTAgAcTTAgTgAAgcg-3 1 
! ON_HCHpyEx01 is the reverse complement of 

! 5 '-cgCttcacTaag tcT aqa gac aaC tcT aag aaT acT ctC taC Ttg -3' 
! Bridges (top strand, 9-base overlap) : 

(ON_HCHpyBr016-l) 5 ' -cgCttcacTaag tcT aqa gac aaC tcT aag- 

aaT acT ctC taC Ttg CAgctgaac-3 T {3 '-term C is 

blocked} 
I 

! 3-15 et al. + 3-11 

(ON_HCHpyBr023-15) 5 r -cgCttcacTaag tcT aqa gac aaC tcT aag- 

aaT acT ctC taC Ttg CAaatgaac-3* {3 T -term C is 

blocked} 



! 5-51 

(ON_HCHpyBr045-51) 5 ' -cgCttcacTaag tcT aqa gac aaC tcT aag- 

aaT acT ctC taC Ttg CAgtggagc-3 1 {3 '-term C is 

blocked} 

! PCR primer (top strand) 

(ON_HCHpyPCR) 5 ' -cgCttcacTaag tcT aqa gac-3 1 

f 

C: BlpI Probes from human HC GLGs 

1 1-58, 1-03, 1-08, 1-69, 1-24, 1-45, 1-46, l-f,l-e 
acatggaGCTGAGCagcctgag 

2 1-02 
acatggaGCTGAGCaggctgag 

3 1-18 
acatggagctgaggagcctgag 
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4 5-51, 5-a 
acctgcagtggagcagcctgaa 

5 3-15,3-73,3-49,3-72 
atctgcaaatgaacagcctgaa 

6 3303,3-33,3-07,3-11,3-30,3-21,3-23,3305,3-48 
atctgcaaatgaacagcctgag 

7 3-20,3-74,3-09,3-43 
atctgcaaatgaacagtctgag 

8 74.1 
atctgcagatctgcagcctaaa 

9 3-66, 3-13, 3-53, 3-d 
atcttcaaatgaacagcctgag 

10 3-64 
atcttcaaatgggcagcctgag 

11 4301, 4-28, 4302, 4-04, 4304, 4-31, 4-34, 4-39, 4-59, 4-61, 4-b 
ccctgaaGCTGAGCtctgtgac 

12 6-1 
ccctgcagctgaactctgtgac 

13 2-70,2-05 
tccttacaatgaccaacatgga 

14 - 2-26 
tccttaccatgaccaacatgga 

D: Blpl REdaptors, Extenders, and Bridges 
D . 1 REdaptors 

TW rp \r 

(BlpF3HCl-58) 5 r -ac atg gaG CTG AGC age ctg ag-3' 7 0 66. 

4 

(BlpF3HC6-l) 5'-cc ctg aag ctg age tct gtg ac-3 1 70 66. 

4 

! BlpF3HC6-l matches 4-30.1, not 6-1. 

D.2 Segment of synthetic 3-23 gene into which captured CDR3 is to 
be cloned 



Blpl 

! Xbal . . . 

!D323* cgCttcacTaag TCT AGA gac aaC tcT aag aaT acT etc taC Ttg 
caG atg aac 
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! Aflll... 
! aq C TTA AG G 

D.3 Extender and Bridges 

! Bridges 

(BlpF3Brl) 5'-cgCttcacTcag tcT aga gaT aaC AGT aaA aaT acT TtG- 

taC Ttg caG Ctg a | GC age ctg-3 1 
(BlpF3Br2) 5 r -cgCttcacTcag tcT aga gaT aaC AGT aaA aaT acT TtG- 

taC Ttg caG Ctg a ! gc tct gtg-3 1 
! | lower strand is cut here 

! Extender 
(BlpF3Ext) 5'- 

Tc AgcTgc AAgTAc AA AgTATTTTTAcTgTTATcTcTA^cTgAgTgAAgcg - 3 ' 
! BlpF3Ext is the reverse complement of: 

! 5 '-cgCttcacTcag tcT aga gaT aaC AGT aaA aaT acT TtG taC Ttg caG 

Ctg a-3' 

I 

(BlpF3PCR) 5 '-cgCttcacTcag tcT aga gaT aaC-3 1 



E: HpyCH4III Distinct GLG sequences surrounding site, bases 77-98 

1 102#l,118#4,14 6#7,169#9,le#10, 311#17, 353#30, 4 04 #37 f 4301 
ccgtgtattactgtgcgagaga 

2 103#2 / 307#15,321#21,3303#24 f 333#26 f 348#28,364#31,366#32 
ctgtgtattactgtgcgagaga 

3 108#3 
ccgtgtattactgtgcgagagg 

4 124#5,lf#ll 
ccgtgtattactgtgcaacaga 

5 145#6 
ccatgtattactgtgcaagata 

6 158#8 
ccgtgtattactgtgcggcaga 

7 205#12 
ccacatattactgtgcacacag 

8 226#13 
ccacatattactgtgcacggat 

9 270#14 
ccacgtattactgtgcacggat 



10 

ccttgtattactgtgcaaaaga 
11 

ctgtgtattactgtgcaagaga 
12 

ccgtgtattactgtaccacaga 
13 

ccttgtatcactgtgcgagaga 
14 

ccgtatattactgtgcgaaaga 
15 

ctgtgtattactgtgcgaaaga 
16 

ccgtgtattactgtactagaga 
17 

ccgtgtattactgtgctagaga 
18 

ccgtgtattactgtactagaca 
19 

ctgtgtattactgtaagaaaga 
20 

ccgtgtattactgtgcgagaaa 
21 

ccgtgtattactgtgccagaga 
22 

ctgtgtattactgtgcgagaca 
23 

ccatgtattactgtgcgagaca 
24 

ccatgtattactgtgcgaga 



309#16,343#27 
313#18,374#35,61#50 
315#19 
32O#20 
323#22 
330#23,3305jf25 
349#29 
372#33 
373#34 
3d#36 
428#38 
4302#40,4304#41 
439#44 
551#48 
5a#49 



F: HpyCH4III REdaptors, Extenders, and Bridges 
F . 1 REdaptors 

! ONs for cleavage of HC (lower) in FR3 (bases 77-97) 

! For cleavage with HpyCH4III, Bst4CI, or Taal 

! cleavage is in lower chain before base 88. 

! 77 788 888 888 889 999 999 9 

! 78 901 234 567 890 123 456 7 

m K 

(H43.77.97.1-02#l) 
(H43.77.97.1-03#2) 
(H43.77.97.108#3) 
(H43.77.97.323#22) 
(H43.77.97.330#23) 
(H43.77.97.439#44) 



5'-cc gtg tat tAC TGT gcg aga g-3 1 6462.6 

5 ! -ct gtg tat tAC TGT gcg aga g-3' 6260.6 

5'-cc gtg tat tAC TGT gcg aga g-3' 6462.6 

5'-cc gta tat tac tgt gcg aaa g-3 1 6058.7 

5'-ct gtg tat tac tgt gcg aaa g-3' 6058.7 

5 r -ct gtg tat tac tgt gcg aga c-3' 6260.6 



- 137 - 



(H43.77.97.551#48) 5'-cc atg tat tac tgt gcg aga c-3 f 6260.* 
<H43.77.97.5a#49) 5'-cc atg tat tAC TGT gcg aga ,-3 r 5858. 2 

F.2 Extender and Bridges 

! Xbal and Aflll sites in bridges are bunged 

(H43.XABrl) 5 1 -ggtgtagtga- 

| TCT | AGt | gac | aac | tct | aag | aat | act | etc | tac | ttg I cag | atg | - 

| aac | a qC I TTt I AGq I get I gag I qac [ aCT I GCA I Gtc I tac j tat tgt gcg aga-3 

(H43 . XABr2 ) 5 ' -ggtgtagtga- 

| TCT | AGt | gac | aac | tct | aag | aat | act | etc | tac | ttg | cag | atg | - 

I aac I a qC I TTt 1 AGq 1 get I gag 1 gac I aCT I GCA I Gtc j tac I tat tgt gcg aaa-3 

(H43.XAExt) 5 1 -ATAgTAgAcT gcAgTgTccT cAgcccTTAA gcTgTTcATc 

TgcAAgTAgA- 

gAgTATTcTT AgAgTTgTcT cTAgATcAcT AcAcc-3 ' 
!H43.XAExt is the reverse complement of 
! 5 ' -ggtgtagtga- 

! | TCT | AGA | gac | aac | tct j aag | aat | act | etc | tac | ttg | cag | atg | - 
! laac|a gClTTA|AGglgct jgaglgacjaCT I GCA I Gtci tac I tat -3' 

(H43.XAPCR) S'-ggtgtagtga | TCT | AGA | gac | aac-3 ' 
! Xbal and Aflll sites in bridges are bunged 
(H43 . ABrl) 5 1 -ggtgtagtga- 

| aac | a gC I TTt 1 AGq \ get I gag I gac I aCT 1 GCA i Gtc I tac 1 tat tgt gcg aga-3 
(H43 . ABr2 ) 5 ' -ggtgtagtga- 

I aac \ a gC I TTt 1 AGg I get I gag I gac I aCT [ GCA I Gtc I tac I tat tgt gcg aaa-3 
(H43 . AExt) 5 ' -ATAgTAgAcTgcAgTgTccTcAgcccTTAAgcTgTTTcAcTAcAcc-3 1 
!(H43.AExt) is the reverse complement of 5 1 -ggtgtagtga- 
! I aac I a gC 1 TTA I AGg I get I gag I gac I aCT i GCA j Gtcl tacl tat -3' 
(H43.APCR) 5'-ggtgtagtga I aac I a gC 1 TTA I AGg I get 1 g-3 | 
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Group 337 471 363 218 130 58 23 11 6 

Cumulative 337 808 1171 1389 1519 1577 1600 1611 1617 

Seqs with the expected RE site only 1511 

Seqs with only an unexpected site 0 

Seqs with both expected and unexpected 8 

Seqs with no sites 0 



Table 5D: 

Analysis repeated using only 8 best REdaptors 
Id Ntot 0 1 2 3 4 5 6 7 8+ 

1 301 78 101 54 32 16 9 10 1 0 281 102#1 
ccgtgtattactgtgcgagaga 

2 493 69 155 125 73 37 14 11 3 6 459 103#2 
ctgtgtattactgtgcgagaga 

3 189 52 45 38 23 18 5 4 1 3 176 108#3 
ccgtgtattactgtgcgagagg 

4 127 29 23 28 24 10 6 5 2 0 114 323#22 
ccgtatattactgtgcgaaaga 

5 78 21 25 14 11 1 4 2 0 0 72 330#23 
ctgtgtattactgtgcgaaaga 6 79 15 17 25 8 11 1 
439#44 ctgtgtattactgtgcgagaca 



7 43 14 


15 5 5 3 0 1 
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0 42 551#48 
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63 72 51 38 24 14 
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Seqs with 


the expected RE site only 




, , 1463 / 1617 


Seqs with 


only an unexpected site.. 




0 



Seqs with both expected and unexpected. . . . 
Seqs with no sites 



Table 6: Human HC GLG FR1 Sequences 

VH Exon - Nucleotide sequence alignment 

VH1 

1-02 CAG GTG CAG CTG GTG CAG TCT GGG GCT GAG GTG AAG AAG CCT GGG GCC TCA GTG AAG 

GTC TCC TGC AAG GCT TCT GGA TAC ACC TTC ACC 
1-03 cag gtC cag ctT gtg cag tct ggg get gag gtg aag aag cct ggg gec tea gtg aag 

gtT tec tgc aag get tct gga tac ace ttc acT 
1-08 cag gtg cag ctg gtg cag tct ggg get gag gtg aag aag cct ggg gee tea gtg aag 

gtc tec tgc aag get tct gga tac ace ttc acc 
1-18 cag gtT cag ctg gtg cag tct ggA get gag gtg aag aag cct ggg gee tea gtg aag 

gtc tec tgc aag get tct ggT tac acc ttT acc 
1-24 cag gtC cag ctg gtA cag tct ggg get gag gtg aag aag cct ggg gee tea gtg aag 

gtc tec tgc aag gTt tcC gga tac acc Ctc acT 
1-45 cag Atg cag ctg gtg cag tct ggg get gag gtg aag aag Act ggg Tec tea gtg aag 

gtT tec tgc aag get tcC gga tac acc ttc acc 
1-4 6 cag gtg cag ctg gtg cag tct ggg get gag gtg aag aag cct ggg gee tea gtg aag 

gtT tec tgc aag gcA tct gga tac acc ttc acc 
1-58 caA Atg cag ctg gtg cag tct ggg Cct gag gtg aag aag cct ggg Acc tea gtg aag 

gtc tec tgc aag get tct gga tTc acc ttT acT 
1-69 cag gtg cag ctg gtg cag tct ggg get gag gtg aag aag cct ggg Tec tcG gtg aag 

gtc tec tgc aag get tct gga GGc acc ttc aGc 
1-e cag gtg cag ctg gtg cag tct ggg get gag gtg aag aag cct ggg Tec tcG gtg aag 

gtc tec tgc aag get tct gga GGc acc ttc aGc 

1- f Gag gtC cag ctg gtA cag tct ggg get gag gtg aag aag cct ggg gcT Aca gtg aaA 

Ate tec tgc aag gTt tct gga tac acc ttc acc 

VH2 

2- 05 CAG ATC ACC TTG AAG GAG TCT GGT CCT ACG CTG GTG AAA CCC ACA CAG ACC CTC ACG 

CTG ACC TGC ACC TTC TCT GGG TTC TCA CTC AGC 
2-2 6 cag Gtc acc ttg aag gag tct ggt cct GTg ctg gtg aaa ccc aca Gag acc ctc acg 
ctg acc tgc acc Gtc tct ggg ttc tea ctc age 

2- 70 cag Gtc acc ttg aag gag tct ggt cct Gcg ctg gtg aaa ccc aca cag acc ctc acA 

ctg acc tgc acc ttc tct ggg ttc tea ctc age 

VH3 

3- 07 GAG GTG CAG CTG GTG GAG TCT GGG GGA GGC TTG GTC CAG CCT GGG GGG TCC CTG AGA 

CTC TCC TGT GCA GCC TCT GGA TTC ACC TTT AGT 
3-09 gaA gtg cag ctg gtg gag tct ggg gga ggc ttg gtA cag cct ggC Agg tec ctg aga 

ctc tec tgt gca gee tct gga ttc acc ttt GAt 
3-11 Cag gtg cag ctg gtg gag tct ggg gga ggc ttg gtc Aag cct ggA ggg tec ctg aga 

ctc tec tgt gca gee tct gga ttc acc ttc agt 
3-13 gag gtg cag ctg gtg gag tct ggg gga ggc ttg gtA cag cct ggg ggg tec ctg aga 

ctc tec tgt gca gee tct gga ttc acc ttc agt 
3-15 gag gtg cag ctg gtg gag tct ggg gga ggc ttg gtA Aag cct ggg ggg tec ctT aga 

ctc tec tgt gca gee tct gga ttc acT ttc agt 
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3-20 gag gtg cag ctg gtg gag tct ggg gga ggT Gtg gtA cGg cct ggg ggg tec ctg aga 

etc tec tgt gca gee tct gga ttc acc ttt GAt 
3-21 gag gtg cag ctg gtg gag tct ggg gga ggc Ctg gtc Aag cct ggg ggg tec ctg aga 

etc tec tgt gca gee tct gga ttc acc ttc agt 
3-23 gag gtg cag ctg Ttg gag tct ggg gga ggc ttg gtA cag cct ggg ggg tec ctg aga 

etc tec tgt gca gee tct gga ttc acc ttt agC 
3-30 Cag gtg cag ctg gtg gag tct ggg gga ggc Gtg gtc cag cct ggg Agg tec ctg aga 

etc tec tgt gca gec tct gga ttc acc ttc agt 
3-30.3 Cag gtg cag ctg gtg gag tct ggg gga ggc Gtg gtc cag cct ggg Agg tec ctg aga 

etc tec tgt gca gec tct gga ttc acc ttc agt 
3-30.5 Cag gtg cag ctg gtg gag tct ggg gga ggc Gtg gtc cag cct ggg Agg tec ctg aga 

etc tec tgt gca gee tct gga ttc acc ttC agt 
3-33 Cag gtg cag ctg gtg gag tct ggg gga ggc Gtg gtc cag cct ggg Agg tec ctg aga 

etc tec tgt gca gcG tct gga ttc acc ttC agt 
3-43 gaA gtg cag ctg gtg gag tct ggg gga gTc Gtg gtA cag cct ggg ggg tec ctg aga 

etc tec tgt gca gee tct gga ttc acc ttt GAt 
3-48 gag gtg cag ctg gtg gag tct ggg gga ggc ttg gtA cag cct ggg ggg tec ctg aga 

etc tec tgt gca gee tct gga ttc acc ttC agt 
3-49 gag gtg cag ctg gtg gag tct ggg gga ggc ttg gtA cag ccA ggg Cgg tec ctg aga 

etc tec tgt Aca gcT tct gga ttc acc ttt Ggt 
3-53 gag gtg cag ctg gtg gag Act ggA gga ggc ttg Ate cag cct ggg ggg tec ctg aga 

etc tec tgt gca gee tct ggG ttc acc Gtc agt 
3-64 gag gtg cag ctg gtg gag tct ggg gga ggc ttg gtc cag cct ggg ggg tec ctg aga 

etc tec tgt gca gec tct gga ttc acc ttc agt 
3-66 gag gtg cag ctg gtg gag tct ggg gga ggc ttg gtc cag cct ggg ggg tec ctg aga 

etc tec tgt gca gee tct gga ttc acc GtC agt 
3-72 gag gtg cag ctg gtg gag tct ggg gga ggc ttg gtc cag cct ggA ggg tec ctg aga 

etc tec tgt gca gee tct gga ttc acc ttC agt 
3-73 gag gtg cag ctg gtg gag tct ggg gga ggc ttg gtc cag cct ggg ggg tec ctg aAa 

etc tec tgt gca gec tct ggG ttc acc ttc agt 
3-74 gag gtg cag ctg gtg gag tcC ggg gga ggc ttA gtT cag cct ggg ggg tec ctg aga 
etc tec tgt gca gec tct gga ttc acc ttc agt 

3- d gag gtg cag ctg gtg gag tct Cgg gga gTc ttg gtA cag cct ggg ggg tec ctg aga 

etc tec tgt gca gee tct gga ttc acc GtC agt 

VH4 

4- 04 CAG GTG CAG CTG CAG GAG TCG GGC CCA GGA CTG GTG AAG CCT TCG GGG ACC CTG TCC 

CTC ACC TGC GCT GTC TCT GGT GGC TCC ATC AGC 
4-28 cag gtg cag ctg cag gag teg ggc cca gga ctg gtg aag cct teg gAC acc ctg tec 

etc acc tgc get gtc tct ggt TAc tec ate age 
4-30.1 cag gtg cag ctg cag gag teg ggc cca gga ctg gtg aag cct tcA CAg acc ctg tec 

etc acc tgc Act gtc tct ggt ggc tec ate age 
4-30.2 cag Ctg cag ctg cag gag tcC ggc Tea gga ctg gtg aag cct tcA CAg acc ctg tec 

etc acc tgc get gtc tct ggt ggc tec ate age 
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4-30.4 cag gtg cag ctg cag gag teg ggc cca gga ctg gtg aag cct tcA CAg acc ctg tec 

etc acc tgc Act gtc tct ggt ggc tec ate age 
4-31 cag gtg cag ctg cag gag teg ggc cca gga ctg gtg aag cct tcA CAg acc ctg tec 
etc acc tgc Act gtc tct ggt ggc tec ate age 
5 4-34 cag gtg cag ctA cag Cag tGg ggc Gca gga ctg Ttg aag cct teg gAg acc ctg tec 
etc acc tgc get gtc tAt ggt ggG tec Ttc agT 
4-39 cag Ctg cag ctg cag gag teg ggc cca gga ctg gtg aag cct teg gAg acc ctg tec 

etc acc tgc Act gtc tct ggt ggc tec ate age 
4-59 cag gtg cag ctg cag gag teg ggc cca gga ctg gtg aag cct teg gAg acc ctg tec 
10 etc acc tgc Act gtc tct ggt ggc tec ate agT 

4-61 cag gtg cag ctg cag gag teg ggc cca gga ctg gtg aag cct teg gAg acc ctg tec 

etc acc tgc Act gtc tct ggt ggc tec Gtc age 
4-b cag gtg cag ctg cag gag teg ggc cca gga ctg gtg aag cct teg gAg acc ctg tec 
etc acc tgc get gtc tct ggt TAc tec ate age 
S 1 5 VH5 

!|1 5_5i GAG GTG CAG CTG GTG CAG TCT GGA GCA GAG GTG AAA AAG CCC GGG GAG TCT CTG AAG 

CP ATC TCC TGT AAG GGT TCT GGA TAC AGC TTT ACC 

HI 5-a gaA gtg cag ctg gtg cag tct gga gca gag gtg aaa aag ccc ggg gag tct ctg aGg 

*? ate tec tgt aag ggt tct gga tac age ttt acc 

* 20 VH6 

^ 6-1 CAG GTA CAG CTG CAG CAG TCA GGT CCA GGA CTG GTG AAG CCC TCG CAG ACC CTC TCA 

V CTC ACC TGT GCC ATC TCC GGG GAC AGT GTC TCT 

ru 

J 7-4.1 CAG GTG CAG CTG GTG CAA TCT GGG TCT GAG TTG AAG AAG CCT GGG GCC TCA GTG AAG 

U4: 



VH7 
7-4. 

25 GTT TCC TGC AAG GCT TCT GGA TAC ACC TTC ACT 



- 149 - 



Table 7: RERS sites in Human HC GLG FRls where there are at least 20 GLGs cut 
Bsgl GTGCAG 71 (cuts 16/14 bases to right) 
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There are 39 hits at base# 4 

There are 21 hits at base# 65 
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There are 43 hits at base# 6 Bolded sites very near sites 

listed below 

There are 21 hits at base# 67 
-"- gctgc 13 
37: 9 38: 9 39: 9 40: 3 40: 9 41: 9 
42: 9 44: 3 44: 9 45: 9 46: 9 47: 9 
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50: 9 

There are 11 hits at base# 9 
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There are 43 hits at base# 6 Often together. 
There are 11 hits at base# 9 
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There are 2 hits at base# 3 
There are 1 hits at base# 12 

There are 21 hits at base# 67 
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25 There are 46 hits at base# 7 
There are 2 hits at base# 1 
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There are 48 hits at base# 8 

There are 2 hits at base# 2 
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There are 22 hits at base# 52 52 and 48 never together. 
There are 9 hits at base# 48 
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There are 22 hits at base# 80 80 and 86 never together 
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There are 35 hits at base# 39 39 and 40 together twice. 
There are 2 hits at base# 40 
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There are 32 hits at base# 40 40 and 41 together twice 

There are 2 hits at base# 41 

There are 9 hits at base# 24 

There are 2 hits at base# 47 
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There are 33 hits at base# 40 
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47: 25 48: 40 48: 41 49: 40 49: 41 50: 25 
50: 74 51: 40 

There are 35 hits at base# 40 
There are 2 hits at base# 41 
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There are 22 hits at base# 46 46 and 43 never together 

There are 11 hits at base# 43 
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There are 17 hits at base# 46 
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46: 21 46: 22 47: 21 47: 22 50: 22 51: 44 
There are 23 hits at base# 47 These do not occur togethi 
There are 11 hits at base# 44 

There are 14 hits at base# 22 These do occur together. 
There are 9 hits at base# 21 
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There are 11 hits at base# 70 
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There are 20 hits at base# 48 
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There are 23 hits at base# 47 44 & 47 never together 
There are 4 hits at base# 44 
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30: 46 31: 46 32; 46 33: 46 34: 46 35: 46 
36: 46 37: 46 43: 79 

There are 22 hits at base# 46 43 and 46 never occur together. 

There are 4 hits at base# 43 
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There are 34 hits at base# 16 
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20: 77 21: 77 22: 77 23: 77 24: 77 25: 77 
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26: 77 27: 77 28: 77 29: 77 31: 77 32: 77 
33: 77 35: 77 36: 77 
There are 21 hits at base# 77 
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There are 34 hits at base# 16 
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33: 56 35: 56 
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There are 34 hits at base# 16 
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15: 56 16: 56 17: 56 18: 56 19: 56 20: 56 
21: 56 22: 56 23: 56 24: 56 25: 56 26: 56 
27: 56 28: 56 29: 56 30: 56 31: 56 32: 56 
33: 56 35: 56 36: 56 
There are 21 hits at base# 56 
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21: 68 22: 68 23: 68 24: 68 25: 68 26: 68 
27: 68 28: 68 29: 68 30: 68 31: 68 32: 68 
33: 68 34: 68 35: 68 36: 68 39: 46 40: 46 
41: 46 42: 46 

There are 22 hits at base# 68 
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CCA 


CTC 


TCC 


CTG 


ccc 




GTC 


ACC 


CTT 


GGA 


CAG 


CCG 


GCC 


TCC 


ATC 


TCC 


TGC 


1 


A17 


GAT 


GTT 


GTG 


ATG 


ACT 


CAG 


TCT 


CCA 


CTC 


TCC 


CTG 


ccc 




GTC 


ACC 


CTT 


GGA 


CAG 


CCG 


GCC 


TCC 


ATC 


TCC 


TGC 


1 


Al 


GAT 


ATT 


GTG 


ATG 


ACC 


CAG 


ACT 


CCA 


CTC 


TCT 


CTG 


TCC 




GTC 


ACC 


CCT 


GGA 


CAG 


CCG 


GCC 


TCC 


ATC 


TCC 


TGC 


I 


A18 


GAT 


ATT 


GTG 


ATG 


ACC 


CAG 


ACT 


CCA 


CTC 


TCT 


CTG 


TCC 




GTC 


ACC 


CCT 


GGA 


CAG 


CCG 


GCC 


TCC 


ATC 


TCC 


TGC 


f 


A2 


GAT 


ATT 


GTG 


ATG 


ACT 


CAG 


TCT 


CCA 


CTC 


TCC 


CTG 


CCC 




GTC 


ACC 


CCT 


GGA 


GAG 


CCG 


GCC 


TCC 


ATC 


TCC 


TGC 


j 


A19 


GAT 


ATT 


GTG 


ATG 


ACT 


CAG 


TCT 


CCA 


CTC 


TCC 


CTG 


CCC 




GTC 


ACC 


CCT 


GGA 


GAG 


CCG 


GCC 


TCC 


ATC 


TCC 


TGC 


1 


A3 


GAT 


ATT 


GTG 


ATG 


ACC 


CAG 


ACT 


CCA 


CTC 


TCC 


TCA 


CCT 




GTC 


ACC 


CTT 


GGA 


CAG 


CCG 


GCC 


TCC 


ATC 


TCC 


TGC 


1 


A23 


GAA 


ATT 


GTG 


TTG 


ACG 


CAG 


TCT 


CCA 


GGC 


ACC 


CTG 


TCT 




TTG 


TCT 


CCA 


GGG 


GAA 


AGA 


GCC 


ACC 


CTC 


TCC 


TGC 


j 


A27 


GAA 


ATT 


GTG 


TTG 


ACG 


CAG 


TCT 


CCA 


GCC 


ACC 


CTG 


TCT 




TTG 


TCT 


CCA 


GGG 


GAA 


AGA 


GCC 


ACC 


CTC 


TCC 


TGC 


j 


All 


GAA 


ATA 


GTG 


ATG 


ACG 


CAG 


TCT 


CCA 


GCC 


ACC 


CTG 


TCT 




GTG 


TCT 


CCA 


GGG 


GAA 


AGA 


GCC 


ACC 


CTC 


TCC 


TGC 




L2 


GAA 


ATA 


GTG 


ATG 


ACG 


CAG 


TCT 


CCA 


GCC 


ACC 


CTG 


TCT 




GTG 


TCT 


CCA 


GGG 


GAA 


AGA 


GCC 


ACC 


CTC 


TCC 


TGC 




L16 


GAA 


ATT 


GTG 


TTG 


ACA 


CAG 


TCT 


CCA 


GCC 


ACC 


CTG 


TCT 




TTG 


TCT 


CCA 


GGG 


GAA 


AGA 


GCC 


ACC 


CTC 


TCC 


TGC 


I 


L6 


GAA 


ATT 


GTG 


TTG 


ACA 


CAG 


TCT 


CCA 


GCC 


ACC 


CTG 


TCT 





- 161 - 



5 



TTG 


TCT 


CCA 


GGG 


GAA 


AGA 


GCC 


ACC 


CTC 


TCC 


TGC 


I 


L20 


GAA 


ATT 


GTA 


ATG 


ACA 


CAG 


TCT 


CCA 


GCC 


ACC 


CTG 


TCT 




TTG 


TCT 


CCA 


GGG 


GAA 


AGA 


GCC 


ACC 


CTC 


TCC 


TGC 


i 


L25 


GAC 


ATC 


GTG 


ATG 


ACC 


CAG 


TCT 


CCA 


GAC 


TCC 


CTG 


GCT 




GTG 


TCT 


CTG 


GGC 


GAG 


AGG 


GCC 


ACC 


ATC 


AAC 


TGC 


1 


B3 


GAA 


ACG 


ACA 


CTC 


ACG 


CAG 


TCT 


CCA 


GCA 


TTC 


ATG 


TCA 




GCG 


ACT 


CCA 


GGA 


GAC 


AAA 


GTC 


AAC 


ATC 


TCC 


TGC 


i 


B2 


GAA 


ATT 


GTG 


CTG 


ACT 


CAG 


TCT 


CCA 


GAC 


TTT 


CAG 


TCT 




GTG 


ACT 


CCA 


AAG 


GAG 


AAA 


GTC 


ACC 


ATC 


ACC 


TGC 


I 


A26 


GAA 


ATT 


GTG 


CTG 


ACT 


CAG 


TCT 


CCA 


GAC 


TTT 


CAG 


TCT 




GTG 


ACT 


CCA 


AAG 


GAG 


AAA 


GTC 


ACC 


ATC 


ACC 


TGC 


t 


A10 


GAT 


GTT 


GTG 


ATG 


ACA 


CAG 


TCT 


CCA 


GCT 


TTC 


CTC 


TCT 




GTG 


ACT 


CCA 


GGG 


GAG 


AAA 


GTC 


ACC 


ATC 


ACC 


TGC 


I 


A14 



Hp 

?! ) 



Mi 



3?: 

m 
m 
ss 

4=: 



5 -&l 

U I 

u 



CM 



ex 

Ua 
ro 
& 

G 

rtl 

6 
X 

■H 

TJ 
C 

O 
4-1 

cn 
<D 
4J 
•H 

cn 



Eh 



5 



a 



if! 



Hi 



CM 
co 



PQ 



CO 



CM 

c3 



CM 



(N 
CM 



CM 



CM 
CM 



CM 

1-1 



o 

T-H 

CM 



O 

o 

CM 



SI 



O 



5 

I 5 



••I": 



-a 



o 


o 




vO 


o 




CM 


*o 








to 



I 




O lO 







CN 


m 


m 


m 


o 


O 


rH 




CM 


CN 



9 



II! 



2 H 



oo 



o o 





r- 


f- 


to 


eo 


co 




CO 


to 


CSJ 


<N 


CN 





CO 




m 


m 


m 


CO 


n- 


to 




W 


CM 



r- 


r- 


r- 


CO 


CO 


CO 


m 


SO 


r- 


tN 




CM 



i-O 



LO 



if; 
to ■ 



CI 

1 



to 



S £ ^ 



a 

X 



1 



CM 



If! 



111 



00 



DC 
.S 

T3 

a 



•9 



o 

IT) 



E <B ? ^ 

PQ u Z 2 



•a 1 < 



o 

5 



5: 

ft 



i 



E *s ^ ? 
pS c3 z z 



§ o 



2 



1 



PQ 

















en 






CI 





o 



LO 



LO 



VI 

r' 



III 



o 



P3 U Z 



2 



a 

a, 



t 
2 



CO 

1 

CM 

1 



1 1 



CO 


CO 


CO 










O 




CM 


CO 


CO 

















CM 


cO 


CO 


CO 


CO 


O 


O 




CM 


CM 






CM 


CO 


cO 


CO 


CO 



CO 


CO 


cO 










CO 


^- 


co 


cO 


co 



CO 


CO 


CO 








CM 


co 




CO 


CO 


CO 



2 



CO 



CM 

PQ 



lO 
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Table 10 Lambda FR1 GLG sequences 
! VL1 

CAG TCT GTG CTG ACT CAG CCA CCC TCG GTG TCT GAA 
GCC CCC AGG CAG AGG GTC ACC ATC TCC TGT ! la 
5 cag tct gtg ctg acG cag ccG ccc tcA gtg tct gGG 

gcc ccA Ggg cag agg gtc acc ate tec tgC ! le 
cag tct gtg ctg act cag cca ccc tcA gCg tct gGG 
Acc ccc Ggg cag agg gtc acc ate tcT tgt ! 1c 
cag tct gtg ctg act cag cca ccc tcA gCg tct gGG 
10 Acc ccc Ggg cag agg gtc acc ate tcT tgt 1 Ig 

cag tct gtg Ttg acG cag ccG ccc tcA gtg tct gCG 
gcc ccA GgA cag aAg gtc acc ate tec tgC ! lb 

! VL2 

CAG TCT GCC CTG ACT CAG CCT CCC TCC GCG TCC GGG 
15 TCT CCT GGA CAG TCA GTC ACC ATC TCC TGC ! 2c 

cag tct gcc ctg act cag cct cGc tcA gTg tec ggg 
2 tct cct gga cag tea gtc acc ate tec tgc! 2e 

|1 cag tct gcc ctg act cag cct Gcc tec gTg tcT ggg 

In tct cct gga cag tcG Ate acc ate tec tgc ! 2a2 

© 20 cag tct gcc ctg act cag cct ccc tec gTg tec ggg 

*** tct cct gga cag tea gtc acc ate tec tgc I 2d 

cag tct gcc ctg act cag cct Gcc tec gTg tcT ggg 
tct cct gga cag tcG Ate acc ate tec tgc ! 2b2 

I VL3 

25 TCC TAT GAG CTG ACT CAG CCA CCC TCA GTG TCC GTG 

TCC CCA GGA CAG ACA GCC AGC ATC ACC TGC! 3r 
tec tat gag ctg act cag cca cTc tea gtg tcA gtg 
Gcc cTG gga cag acG gcc agG atT acc tgT ! 3j 
tec tat gag ctg acA cag cca ccc tcG gtg tcA gtg 

30 tec cca gga caA acG gcc agG ate acc tgc! 3p 

tec tat gag ctg acA cag cca ccc tcG gtg tcA gtg 
tec cTa gga cag aTG gcc agG ate acc tgc ! 3a 
tcT tCt gag ctg act cag GAC ccT GcT gtg tcT gtg 
Gcc TTG gga cag aca gTc agG ate acA tgc ! 31 
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tec tat gTg ctg 
Gcc cca gga Aag 
tec tat gag ctg 
tec cca gga cag 
tec tat gag ctg 
tec cca gga cag 
tec tat gag ctg 
tcT ccG gga cag 

CTG CCT GTG CTG 

TTG CTG GGA GCC 

cAg cct gtg ctg 

tCC ctg gga Tec 

cAg cTt gtg ctg 

tCC ctg gga gcc 

CAG CCT GTG CTG 
TCT CCT GGA GAA 
cag Get gtg ctg 
tct cct gga gCa 
cag cct gtg ctg 
tct Tct gga gCa 

AAT TTT ATG CTG 
TCT CCG GGG AAG 

CAG ACT GTG GTG 
TCC CCA GGA GGG 
cag Get gtg gtg 
tec cca gga ggg 



act cag cca ccc 
acG gcc agG atT 
acA cag cTa ccc 
aca gcc agG ate 
aTG cag cca ccc 
acG gcc agG ate 
acA cag cca Tec 
aca gcc agG ate 

ACT CAG CCC CCG 
TCG ATC AAG CTC 
act caA TcA TcC 
teg Gtc aag etc 
act caA TcG ccC 
teg Gtc aag etc 

ACT CAG CCA CCT 
TCC GCC AGA CTC 
act cag ccG Get 
tcA gcc agT etc 
act cag cca Tct 
tcA gTc aga etc 

ACT CAG CCC CAC 
ACG GTA ACC ATC 

ACT CAG GAG CCC 
ACA GTC ACT CTC 
act cag gag ccc 
aca gtc act etc 



tea gtg tcA gtg 
ace tgT ! 3h 
tcG gtg tcA gtg 
ace tgc ! 3e 
tcG gtg tcA gtg 
acc tgc ! 3m 
tea gtg tcA gtg 
acc tgc I V2-19 

TCT GCA TCT GCC 
ACC TGC ! 4c 
tct gcC tct gcT 
acc tgc ! 4a 
tct gcC tct gcc 
acc tgc ! 4b 

TCC TCC TCC GCA 
ACC TGC ! 5e 
tec CTc tcT gca 
acc tgc ! 5c 
tec CAT tcT gca 
acc tgc ! 5b 

TCT GTG TCG GAG 
TCC TGC ! 6a 

TCA CTG ACT GTG 
ACC TGT ! 7a 
tea ctg act gtg 
acc tgt ! 7b 



CAG ACT GTG GTG ACC CAG GAG CCA TCG TTC TCA GTG 
TCC CCT GGA GGG ACA GTC ACA CTC ACT TGT ! 8a 
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! VL9 

CAG CCT GTG CTG ACT CAG CCA CCT TCT GCA TCA GCC 

TCC CTG GGA GCC TCG GTC ACA CTC ACC TGC ! 9a 

! VL10 

5 CAG GCA GGG CTG ACT CAG CCA CCC TCG GTG TCC AAG 

GGC TTG AGA CAG ACC GCC ACA CTC ACC TGC ! 10a 



M 



w 
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10 



Table 11 RERSs found in human lambda FR1 GLGs 
1 There are 31 lambda GLGs 
Mlyl NnnnnnGACTC 



25 





1: 


6 


3: 


6 


4: 


6 


6: 


6 


7: 


6 


8: 


6 


5 


9: 


6 


10: 


6 


11: 


6 


12: 


6 


15: 


6 


16: 


6 




20: 


6 


21: 


6 


22: 


6 


23: 


6 


23: 


50 


24: 


6 




25: 


6 


25: 


50 


26: 


6 


27: 


6 


28: 


6 


30: 


6 



31: 6 

There are 23 hits at base# 

-"- GAGTCNNNNNn 
26: 34 



15 



20 



25 



30 



Mwol GCNNNNNnngc 



20 



1: 


9 


2: 


9 


3: 


9 


4: 


9 


11: 


9 


11: 


56 


12: 


9 


13: 


9 


14: 


9 


16: 


9 


17: 


9 


18: 


9 


19: 


9 


20: 


9 


23: 


9 


24: 


9 


25: 


9 


26: 


9 


30: 


9 


31: 


9 



















There are 19 hits at base# 
Hinfl Gantc 



27 



1: 


12 


3: 


12 


4: 


12 


6: 


12 


7: 


12 


8: 


9: 


12 


10: 


12 


11: 


12 


12: 


12 


15: 


12 


16: 


20: 


12 


21: 


12 


22: 


12 


23: 


12 


23: 


46 


23: 


24: 


12 


25: 


12 


25: 


56 


26: 


12 


26: 


34 


27: 


28: 


12 


30: 


12 


31: 


12 













There are 23 hits at base# 12 
Plel gactc 25 
1: 12 3: 12 4: 12 6: 12 
9: 12 10: 12 11: 12 12: 12 

20: 12 21: 12 22: 12 23: 12 

25: 12 25: 56 26: 12 27: 12 

31: 12 

There are 23 hits at base# 12 



7; 

15; 



12 

12 



23: 56 
28: 12 



56 
12 



8: 12 

16: 12 

24: 12 

30: 12 



35 -"- gagtc 



1 
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26: 34 



Ddel Ctnag 



1: 


14 


2: 


24 


3: 


14 


3: 


24 


4: 


14 


4: 


24 


5: 


24 


6: 


14 


7: 


14 


7: 


24 


8: 


14 


9: 


14 


10: 


14 


11: 


14 


11: 


24 


12: 


14 


12: 


24 


15: 


5 


15: 


14 


16: 


14 


16: 


24 


19: 


24 


20: 


14 


23: 


14 


24: 


14 


25: 


14 


26: 


14 


27: 


14 


28: 


14 


29: 


30 


30: 


14 


31: 


14 



















There are 21 hits at base# 14 



BsaJI Ccnngg 38 



1: 


23 


1: 


40 


2: 


39 


2: 


40 


3: 


39 


3: 


40 


4: 


39 


4: 


40 


5: 


39 


11: 


39 


12: 


38 


12: 


39 


13: 


23 


13: 


39 


14: 


23 


14: 


39 


15: 


38 


16: 


39 


17: 


23 


17: 


39 


18: 


23 


18: 


39 


21: 


38 


21: 


39 


21: 


47 


22: 


38 


22: 


39 


22: 


47 


26: 


40 


27 : 


39 


28: 


39 


29: 


14 


29: 


39 


30: 


38 


30: 


39 


30: 


47 


31: 


23 


31: 


32 



















There are 17 hits at base# 39 

There are 5 hits at base# 38 

There are 5 hits at base# 40 Makes cleavage ragged. 

Mnll cctc 35 



1: 


23 


2: 


23 


3: 


23 


4: 


23 


5: 


23 


6: 


19 


6: 


23 


7: 


19 


8: 


23 


9: 


19 


9: 


23 


10: 


23 


11: 


23 


13: 


23 


14: 


23 


16: 


23 


17: 


23 


18: 


23 


19: 


23 


20: 


47 


21: 


23 


21: 


29 


21: 


47 


22: 


23 


22: 


29 


22: 


35 


22: 


47 


23: 


26 


23: 


29 


24: 


27 


27: 


23 


28: 


23 


30: 


35 


30: 


47 


31: 


23 







There are 21 hits at base# 23 
There are 3 hits at base# 19 
There are 3 hits at base# 29 
There are 1 hits at base# 26 

There are 1 hits at base* 27 These could make cleavage 

-"- gagg 7 
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1: 48 


2: 


48 


3: 


48 


4: 


48 


27: 


44 


29: 44 


















JssKI Nccngg 










39 






1: 40 


2: 


39 


3: 


39 


3: 


40 


4: 


39 


5: 39 


6: 


31 


6: 


39 


7: 


31 


7: 


39 


9: 31 


9: 


39 


10: 


39 


11: 


39 


12: 


38 


13: 39 


13: 


52 


14: 


52 


16: 


39 


16: 


52 


17: 52 


18: 


39 


18: 


52 


19: 


39 


19: 


52 


22: 38 


23: 


39 


24: 


39 


26: 


39 


27: 


39 


29: 14 


29: 


39 


30: 


38 











There are 21 hits at base# 39 

There are 4 hits at base# 38 

There are 3 hits at base* 31 

There are 3 hits at base# 40 Ragged 



BstNI CCwgg 



1: 


41 


2: 


40 


5: 


40 


6: 


40 


7: 


40 


9: 


40 


10: 


40 


11: 


40 


12: 


39 


12: 


53 


13: 


53 


14: 


53 


16: 


40 


16: 


53 


17: 


40 


18: 


40 


18: 


53 


19: 


53 


21: 


39 


22: 


39 


24: 


40 


27: 


40 


28: 


40 


29: 


15 


29: 


40 



There are 17 hits at base# 40 
There are 7 hits at base# 53 
There are 4 hits at base# 39 
There are 1 hits at base# 41 Ragged 



PspGI ccwgg 



1: 


41 


2: 


40 


5: 


40 


6: 


40 


7: 


40 


9: 


40 


10: 


40 


11: 


40 


12: 


39 


12: 


53 


13: 


53 


14: 


53 


16: 


40 


16: 


53 


17: 


40 


18: 


40 


18: 


53 


19: 


53 


21: 


39 


22: 


39 


24: 


40 


27: 


40 


28: 


40 


29: 


15 


29: 


40 



There are 17 hi^s at base# 40 
There are 7 hits at base# 53 
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There are 4 hits at base# 39 
There are 1 hits at base# 41 



10 



15 



ScrFI CCngg 



39 



1: 


41 


2: 


40 


3: 


40 


3: 


41 


4: 


40 


4: 


41 


5: 


40 


6: 


32 


6: 


40 


7: 


32 


7: 


40 


8: 


40 


9: 


32 


9: 


40 


10: 


40 


11: 


40 


12: 


39 


12: 


53 


13: 


40 


13: 


53 


14: 


53 


16: 


40 


16: 


53 


17: 


40 


17: 


53 


18: 


40 


18: 


53 


19: 


40 


19: 


53 


21: 


39 


22: 


39 


23: 


40 


24: 


40 


26: 


40 


27: 


40 


28: 


40 


29: 


15 


29: 


40 


30: 


39 















There are 21 hits at 
There are 4 hits at 
There are 3 hits at 

Maelll gtnac 

1: 52 2: 52 3: 

7: 52 9: 52 26: 

28: 52 29: 10 29: 



base# 40 
base# 39 
base# 41 



52 
52 
52 



16 
4: 52 
27: 10 
30: 52 



20 There are 13 hits at base# 52 



5: 52 
27: 52 



6: 52 
28: 10 



25 



Tsp45I gtsac 

1: 52 2: 52 3: 52 
7: 52 9: 52 27: 10 
29: 10 29: 52 30: 52 
There are 12 hits at base# 52 



15 
4: 52 
27: 52 



5: 52 
28: 10 



6: 52 
28: 52 



HphI tcacc 

1: 53 2: 53 3: 53 

30 7: 53 8: 53 9: 53 

14: 59 17: 59 18: 59 

22: 59 23: 59 24: 59 

30: 59 31: 59 



26 
4: 53 
10: 53 
19: 59 
25: 59 



5: 53 

11: 59 

20: 59 

27: 59 



6: 53 

13: 59 

21: 59 

28: 59 



There are 16 hits at base# 59 
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There are 10 hits at base# 53 

BspMI ACCTGCNNNNn 14 
11: 61 13: 61 14: 61 17: 61 18: 61 19 
20: 61 21: 61 22: 61 23: 61 24: 61 25 
30: 61 31: 61 

There are 14 hits at base# 61 Goes into CDR1 
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Table 12: Matches to URE FR3 adapters in 79 human HC. 
A. List of Heavy-chains genes sampled 



15 



30 



35 



40 





AF103367 


HSA235674 


HSU94417 


S83240 




AF103368 


HSA235673 


HSU94418 


SABVH369 


r\£ _L U O \J \J 


AF103369 


HSA240559 


HSU96389 


SADEIGVH 




AF103370 


HSCB201 


HSU96391 


SAH2IGVH 




af 103371 


HSIGGVHC 


HSU96392 


SDA3IGVH 


Afl03072 


AF103372 


HSU44791 


HSU96395 


SIGVHTTD 


afl03078 


AF158381 


HSU44793 


HSZ93849 


/-* T TTV A T /"*T 7T.T 

SUK4IGVH 


AF103099 


E05213 


HSU82771 


HSZ93850 




AF103102 


E05886 


HSU82949 


HSZ93851 




AF103103 


E05887 


HSU82950 


HSZ93853 




AF103174 


HSA235661 


HSU82952 


HSZ93855 




AF103186 


HSA235664 


HSU82961 


HSZ93857 




afl03187 


HSA235660 


HSU86522 


HSZ93860 




AF103195 


HSA235659 


HSU86523 


HSZ93863 




afl03277 


HSA235678 


HSU92452 


MCOMFRAA 




afl03286 


HSA235677 


HSU94412 


MCOMFRVA 




AF103309 


HSA235676 


HSU94415 


S82745 




afl03343 


HSA235675 


HSU94416 


S82764 




Table 12B. 


Testing all 


distinct GLGs 


from bases 89.1 


to 93.2 o: 



NO 



Id 


Nb 


0 


1 


2 


3 


4 




SEQ ID 


• 

1 


38 


15 


11 


10 


0 


2 


Seql gtgtattactgtgc 


25 


2 


19 


7 


6 


4 


2 


0 


Seq2 gtAtattactgtgc 


26 


3 


1 


0 


0 


1 


0 


0 


Seq3 gtgtattactgtAA 


27 


4 


7 


1 


5 


1 


0 


0 


Seq4 gtgtattactgtAc 


28 


5 


0 


0 


0 


0 


0 


0 


Seq5 Ttgtattactgtgc 


29 


6 


0 


0 


0 


0 


0 


0 


Seq6 TtgtatCactgtgc 


30 


7 


3 


1 


0 


1 


1 


0 


Seq7 ACAtattactgtgc 


31 


8 


2 


0 


2 


0 


0 


0 


Seq8 ACgtattactgtgc 


32 


9 


9 


2 


2 


4 


1 


0 


Sea9 ATatattactatac 


33 



Group 

Cumulative 



26 26 21 4 2 
26 52 73 77 79 



Table 12C Most important URE recognition seqs in FR3 Heavy 

1 VHSzyl GTGtattactgtgc (ON_SHC103) (SEQ ID NO: 25) 

2 VHSzy2 GTAtattactgtgc (ON_SHC323) (SEQ ID NO: 26) 

3 VHSzy4 GTGtattactgtac (ON_SHC349) (SEQ ID NO: 28) 

4 VHSzy9 ATGtattactgtgc (ON_SHC5a) (SEQ ID NO: 33) 

Table 12D, testing 79 human HC V genes with four probes 

Number of sequences 79 

Number of bases 29143 
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Number of mismatches 
Id Best 0 1 2 3 4 5 

1 39 15 11 10 1 2 0 Seql gtgtattactgtgc (SEQ ID NO:25) 

2 22 765301 Seq2 gtAtattactgtgc (SEQ ID NO: 26) 

3 7 151000 Seq4 gtgtattactgtAc (SEQ ID NO: 28) 

4 ii 244100 Seq9 ATatattactqtqc (SEQ ID NO: 33) 
Group 25 26 20 5 2 

Cumulative 25 51 71 76 78 



One sequence has five mismatches with sequences 2, 4, and 9; 
it is scored as best for 2. 

Id is the number of the adapter. 

Best is the number of sequence for which the identified 
adapter was the best available. 

The rest of the table shows how well the sequences match the 
adapters. For example, there are 10 sequences that match 
VHSzyl(Id=l) with 2 mismatches and are worse for all other 
adapters. In this sample, 90% come within 2 bases of one of 
the four adapters. 
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Table 13 

The following list of enzymes was taken from 
http : //rebase . neb . com/cai-bin/as vmmlist . 

I have removed the enzymes that a) cut within the recognition, b) cut 
both sides of the recognition, or c) have fewer than 2 bases between 
recognition and closest cut site. 



REBASE Enzymes 
04/13/2001 



Type II re 
Enzymes 
Aarl 
Acelll 
Bbr7I 
Bbvl 
BbvII 
Bce83I 
BceAI 
Bcefl 
BciVI 
Bfil 
BinI 
BscAI 
BseRI 
BsmFI 
BspMI 
Ecil 
Eco57I 
Faul 
Fokl 
Gsul 
Hgal 
HphI 
MboII 
Mlyl 
Mmel 
Mnll 
Plel 
RleAI 
SfaNI 
SspDSI 
Sthl32I 
StsI 
Taqll 
Tthlllll 
UbaPI 



striction enzymes with asymmetric recognition sequences 



Recognition Sequence Isoschizomers 
CAC CT GCNNNN A NNNN_ 
CAGCTCNNNNNNN A NNNN_ 
GAAGACNNNNNNN A NNNN_ 
GCAGCNNNNNNNN A NNNN_ 
GAAGACNN A NNNN_ 
CTTGAGNNNNNNNNNNNNNN__NN A - 
ACGGCNNNNNNNNNNNN A NN_ 
ACGGCNNNNNNNNNNNN A N_ 
GT AT CCNNNNN_N A Bful 
ACTGGGNNNN_N A Bmrl 
GGATCNNNN A N_ 
GCATCNNNN A NN_ 
GAGGAGNNNNNNNN_NN A 

GGGACNNNNNNNNNN A NNNN_ BspLUl II 1 1 

AC CT GCNNNN A NNNN_ Acc36I 
GGCGGANNNNNNNNN_NN A 

CTGAAGNNNNNNNNNNNNNN_NN A BspKTSI 
CCC GCNNNN A NN_ ^ BstFZ438I 

GGATGNNNNNNNNN A NNNN_ BstPZ418I 
CT GGAGNNNNNNNNNNNNNN_NN A 
GACGCNNNNN A NNNNN_ 

GGTGANNNNNNN_N A ~ AsuHPI 
GAAGANNNNNNN_N A 

GAGT CNNNNN A SchI 

TCCRACNNNNNNNNNNNNNNNNNN_NN A 

CCTCNNNNNN_N A 

GAGT CNNNN A N_ PpsI 

CCCACANNNNNNNNN_NNN A 

GCAT CNNNNN A NNNN_ BspSTSl 

GGT GANNNNNNNN A 

CCCGNNNN A NNNN_ 

GGATGNNNNNNNNNN A NNNN_ 

GAC C GANNNNNNNNN_NN A , CAC C CANNNNNNNNN_NN A 

CAARCANNNNNNNNN_NN A 

CGAACG 



Suppliers 

y 



y 
y 



y 
y 
y 
y 
y 
y 
y 
y 
y 
y 
y 
y 

y 
y 



The notation is A means cut the upper strand and _ means cut the lower 
strand. If the upper and lower strand are cut at the same place, then oi 
A appears. 
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Table 15: Use of Fokl as "Universal Restriction Enzyme" 

Fokl - for dsDNA, 1 represents sites of cleavage 

sites of cleavage 
5 ' -cac GGATG tq — nnnnnnn | nnnnnnn-3 ' (SEQ ID NO: 15) 
5 3 1 -gtgCCTACac — nnnnnnnnnnn j nnn~5 ' ( SEQ ID NO: 16) 

RECOG 

NITion of Fokl 



Case I 

5'-. . .gtgi tatt-actgtgc. .Substrate -3 r (SEQ ID NO:17) 

10 3 1 -cac- ataa ) tqacacq — > 

C[tGTAGGcac\ 
5'- caCATCCgtg/ (SEQ ID NO: 18) 

Case II 

5 1 - . . . gtgtatt | agac-tgc. . Substrate -3 1 (SEQ ID NO:19) 

15 i — cacataa -tctq 1 acq-5 ' 
/gtgCCTACac 

\cacGGATGtg~3 ' (SEQ ID NO:20) 

Case III (Case I rotated 180 degrees) 

/gtgCCTACac-5' 
20 \ ca cGGATGt£— j 

gtgtctt I acaq-tcc-3 1 Adapter (SEQ ID NO: 21) 
3 T -. . .cacagaa-tgtcjagg. .substrate. . . .-5 1 (SEQ ID NO: 22) 

Case IV (Case II rotated 180 degrees) 

3'- gtGTAGGcac\ (SEQ ID NO: 23) 
2 5 (—ca CAT C Cg t g/ 

5 1 -gag | tctc-actgaqc 
Substrate 3 ' - . . . ctc-agag ( tgactcg. . . -5 ' (SEQ ID NO:24) 



Improved Fokl adapters 



Fokl - for dsDNA, | represents sites of cleavage 

30 Case I 

Stem 11, loop 5 f stem 11, recognition 17 

5 1 - . . . catgtg | tatt-actgtgc . . Substrate .... -3 ' 
3 ' -gtacac- ataa I tgacacq — i r T— ] 

qtGTAGGcacG T 

35 5'- caCATCCgtgc C 
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Case II 

Stem 10, loop 5, stem 10, recognition 18 

5'-. . . gtgtatt|agac-tgctgcc. .Substrate. . . .-3' 
r T-| [ —cacataa -tctql acgacgg-5 1 

T gtgCCTACac 
C cacGGATGtg-3 1 

Case III (Case I rotated 180 degrees) 
Stem 11, loop 5, stem 11, recognition 20 

r T i 

T TgtgCCTACac-5 ' 
G Ac a cGGAT Gtq— i 

Ltt j qtqtctt 1 acaq-tccattctg-3 ' Adapter 

3 1 -. . .cacagaa-tgtc|aggtaagac. .substrate. . . .-5 

Case IV (Case II rotated 180 degrees) 
Stem 11, loop 4, stem 11, recognition 17 

fTi 

3'- gtGTAGGcacc T 
,-caCATCCgtgg T 
5 1 -atcaaa I tctc-actqaqc L T J 
Substrate 3'-. . . tagctc-agag | tgactcg . . .-5' 



BseRI 

| sites of cleavage 
5 1 -cac GAGGAG nnnnnnnnnn 1 nnnnn-3 1 
3 ' -gtgctcctcnnnnnnnn | nnnnnnn-5 1 
RECOG 

NITion of BseRI 

Stem 11, loop 5, stem 11, recognition 19 

3 1 - gaacat | cg-ttaagccagta 5 * 

[-T--T-! cttgta-gc | aattcggtcat~3 1 

C GCTGAGGAGTC— ^ 

T cgactcctcag-5' An adapter for BseRI to cleave the 
L T I 
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What happens in the top strand: 

! j jfte of cleavage m the xippet $t*m& 

(VL133-2a2*) 5'-g tct cct g | ga cag teg ate 
! 

5 (VL133-31*) 5'-g gee ttg g | ga cag aca gtc 
I 

(VL1 33-2c*) 5'-g tct cct g | ga cag tea gtc 



10 



(VL1 33-1 c*) 5'-g gee cca g | gg cag agg gtc 

t The following Extenders and Bridges all encode the AA sequence of 2a2 for codons 1-15 

! ° 1 

(ON_LamExl33) S'-ccTcTgAcTgAgT gcA cAg - 
! 

15 ! 2 3 4 5 6 7 8 9 10 11 12 

AGt gcT TtA acC caA ccG gcT AGT gtT AGC ggT- 



0'i 



Ill 



20 



25 



30 



35 



! 13 14 15 

tcCccGg! 2a2 
! 1 

(ON_LamBl-133) \RQ 5'~ccTcTgAcTgAgT gcAcAg- 

2 3 4 5 6 7 8 9 10 11 12 
AGt gcT TtA acC caA ccG gcT AGT gtT AGC ggT- 

13 14 15 

tcCccG ggacagtcgat-3* ! 2a2 N+B> Ihe actual seq k ■&£ 
reverse eoasplement of the 
one shown. 

(ON_LamB2-133) [RQ 5'-ccTcTgAcTgAgT gcA cAg - 



2 3 4 5 6 7 8 9 10 11 12 

AGt gcT TtA acC caA ccG gcT AGT gtT AGC ggT- 

13 14 15 

tcC ccG g ga cag aca gt-3* ! 31 KB. the actual seq k &e 
reverse complement of *fae 
one shemxt. 



40 ! 
! 



45 



50 



(ON_LamB3-133) [RC] 5-ccTcTgAcTgAgT gcA cAg - 
! 

2 3 4 5 6 7 8 9 10 11 12 
AGt gcT TtA acC caA ccG gcT AGT gtT AGC ggT- 

13 14 15 

tcC ccG g ga cag tea gt -3'! 2c HM> the actual seq is the 



one sfaawtL 

(ON LamB4-133) [RC] 5 1 -ccTcTgAcTgAgT gcA cAg 



i|| |i I [|l >'t Ml ' ITHHIJUIMI M 
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i 2 3 4 5 6 7 8 9 10 11 12 

AGt gcT TtA acC caA ccG gcT AGT gtT AGC ggT-s 

! 13 14 15 

tcC ccG g gg cag agg gt-3 T ! lc ff<£, the actual seq is the 

, reverse ccoMpl«)went of the 

| . one ©howa* 

i 

(ON_Laml33PCR) 5 1 -ccTcTgAcTgAgT gcA cAg AGt gc-3 T 



- 192 - 



Table 19: Cleavage of 75 human light chains. 

Enzyme Recognition* Nch Hs Planned location of 



Afel AGCgct 0 0 



Aflll 


Cttaag 


0 


0 HC FR3 


Agel 


Accggt 


0 


0 


AscX 


GGcgcgcc 


0 


0 After LC 


Bglll 


Agatct 


0 


0 


BsiWI 


Cgtacg 


0 


0 


BspDI 


ATcgat 


0 


0 


BssHII 


Gcgcgc 


0 


0 


BstBI 


TTcgaa 


0 


0 


Dralll 


CACNNNgtg 


0 


0 


EagI 


Cggccg 


0 


0 


Fsel 


GGCCGGcc 


0 


0 


Fspl 


TGCgca 


0 


0 


Hpal 


GTTaac 


0 


0 


Mfel 


Caattg 


0 


0 HC FR1 


Mlul 


Acgcgt 


0 


0 


Ncol 


Ccatgg 


0 


0 Heavy chain signal 


Nhel 


Gctagc 


0 


0 HC/anchor linker 


NotI 


GCggccgc 


0 


0 In linker after HC 


Nrul 


TCGcga 


0 


0 


Pad 


TTAATtaa 


0 


0 


Pmel 


GTTTaaac 


0 


0 


Pmll 


CACata 


0 


0 


Pvul 


C GAT eg 


0 


0 


SacII 


CCGCgg 


0 


0 


Sail 


Gtcgac 


0 


0 


Sfil 


GGCCNNNNnggcc 


0 


0 Heavy Chain signal 


Sgfl 


GCGATcgc 


0 


0 


SnaBI 


TACgta 


0 


0 


StuI 


AGGcct 


0 


0 


Xbal 


Tctaga 


0 


0 HC FR3 


Aatll 


GACGTc 


1 


1 


Acll 


AAcgtt 


1 


1 


As el 


ATtaat 


1 


1 


BsmI 


GAATGCN 


1 


1 


BspEI 


Tcccrcra 


1 


1 HC FRl 


O-4-VT 
X5S> LAI 


C* C ANNNNNn. t era" 


1 


1 HC FR2 


jj j_ ax 




1 


1 


H-i nHTTT 
niiiui x jl 


&a rrr't* t" 


1 


1 


rCli 


Acatgt 


1 


1 


Sapl 


gaagagc 


1 


1 


Seal 


AGTact 


1 

X 


1 


SexAI 


Accwggt 


1 


1 


Spel 


Actagt 


1 


1 


Tlil 


Ctcgag 


1 


1 


Xhol 


Ctcgag 


1 


1 


Bcgl 


egannnnnntge 


2 


2 


BlpI 


GCtnagc 


2 


2 


BssSI 


Ctcgtg 


2 


2 


BstAPI 


GCANNNNntgc 


2 


2 


Espl 


GCtnagc 


2 


2 


KasI 


Ggcgcc 


2 


2 


PflMI 


CCANNNNntgg 


2 


2 


XmnI 


GAANNnnttc 


2 


2 


ApaLI 


Gtgcac 


3 


3 LC signal seq 
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Nael 


GCCggc 




o 


NgoMI 


Gccggc 


•3 




PvuII 


CAGctg 


o 


Q 


RsrII 


CGgwccg 


Q 


o 


BsrBI 


GAGcgg 






BsrDI 


GCAATGNNn 


4 


4 


BstZ17I 


GTAtac 


A 


4 


EcoRI 


Gaattc 


A 


4 


SphI 


GCATGc 


4 


4 


Sspl 


AATatt 


A 


4 


AccI 


GTmkac 


C. 
J 


-J 


Bell 


Tgatca 




5 


BsmBI 


Nnnnnngagacg 




•J 


"Re rCT 


± y La i—GL 


5 


5 


DLdX 


TTTaaa 


6 


6 


wcei 




6 


6 HC FR4 


Swal 


Ai l i aaat 


6 


6 


BamHI 


Ggatcc 


I 


7 


SacI 


GAGCTc 


7 


7 


BciVI 


GT AT CCNNNNNN 


Q 
O 


Q 
0 


BsaBI 


GATNNnnatc 


o 
o 


ft 

o 


Nsil 


ATGCAt 


Q 

o 


Q 
0 


Bspl20I 


Gggccc 


Q 


9 CHI 


Apal 


GGGCCc 


y 


Q PHI 


PspOOMI 


Gggccc 


y 


Q 

y 


BspHI 


Tcatga 


y 


1 1 
ii 


EcoRV 


GATatc 


9 


y 


Ahdl 


GACNNNnngtc 


11 


11 


Bbsl 


GAAGAC 


ii 


i 4 

1 fi 


Psil 


TTAtaa 


±Z 


xz 


Bsal 


GGTCTCNnnnn 


1j 


1 S 
lj 


Xmal 


Cccggg 


1 o 

13 


14 


Aval 


Cycgrg 


14 


1 0 


Bgll 


GCCNNNNnggc 


14 


1 1 
1 / 


AlwNI 


CAGNNNctg 


lo 


ID 


BspMI 


ACCTGC 


1 7 
x i 




Xcml 


CCANNNNNnnnntgg 


i 7 
i / 


Z D 


BstEII 


Ggtnacc 




oo Tir* TTT34 


Sse8387I 


CCTGCAgg 


zU 


z u 


Avrll 


Cctagg 


zz 


zz 


Hindi 


GTYrac 


22 


zz 


Bsgl 


GTGCAG 


27 


29 


MscI 


TGGcca 


30 


34 


BseRI 


NNnnnnnnnnct cc t c 


32 


35 


Bsu36I 


CCtnagg 


Q C 
OJ 


o / 


PstI 


CTGCAg 


35 


40 


Ecil 


nnnnnnnnntccgcc 


38 


40 


PpuMI 


RGgwccy 


41 


50 


Styl 


Ccwwgg 


44 


73 


EcoOl09I 


RGgnccy 


46 


70 


Acc65l 


Ggtacc 


50 


51 


Kpnl 


GGTACc 


50 


51 


Bpml 


ctccag 


53 


82 


Avail 


Ggwcc 


71 


124 



* cleavage occurs in the top strand after the last upper-case base. For REs 
that cut palindromic sequences, the lower strand is cut at the symmetrical 
site . 
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Table 20: Cleavage of 79 human heavy chains 

Nch Ns Planned location of site 



HC FR3 
After LC 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



Enzyme 


Recocrnition 


Nch 


Ns 


Afel 


AGCgct 


0 


0 


Aflll 


Cttaag 


0 


0 


AscI 


GGcgcgcc 


0 


0 


BsiWI 


Cgtacg 


0 


0 


BspDI 


ATcgat 


0 


0 


BssHII 


Gcgcgc 


0 


0 


Fsel 


GGCCGGcc 


0 


0 


Hpal 


GTTaac 


0 


0 


Nhel 


Gctagc 


0 


0 


NotI 


GCggccgc 


0 


0 


Nrul 


TCGcga 


0 


0 


Nsil 


ATGCAt 


0 


0 


Pad 


TTAATtaa 


0 


0 


Pcil 


Acatgt 


0 


0 


Pmel 


GTTTaaac 


0 


0 


Pvul 


CGATcg 


0 


0 


RsrII 


CGgwccg 


0 


0 


Sapl 


gaagagc 


0 


0 


Sfil 


GGCCNNNNnggcc 


0 


0 


Sgfl 


GCGATcgc 


0 


0 


Swal 


ATTTaaat 


0 


0 


Acll 


AAcgtt 


1 


1 


Age I 


Accggt 


1 


1 


Asel 


ATtaat 


1 


1 


Avrll 


Cctagg 


1 


1 


BsmI 


GAATGCN 


1 


1 


BsrBI 


GAGcgg 


1 


1 


BsrDI 


GCAATGNNn 


1 


1 


Dral 


TTTaaa 


1 


1 


Fspl 


TGCgca 


1 


1 


Hindlll 


Aagctt 


1 


1 


Mfel 


Caattg 


1 


1 


Nael 


GCCggc 


1 


1 


NgoMI 


Gccggc 


1 


1 


Spel 


Actagt 


1 


1 


Acc65I 


Ggtacc 


2 


2 


BstBI 


TTcgaa 


2 


2 


Kpnl 


GGTACc 


2 


2 


Mlul 


Acgcgt 


2 


2 


Ncol 


Ccatgg 


2 


2 


Ndel 


CAtatg 


2 


2 


Pmll 


CACgtg 


2 


2 


Xcml 


CCANNNNNnnnntgg 


2 


2 


Bcgl 


egannnnnntge 


3 


3 


Bell 


Tgatca 


3 


3 


Bgll 


GCCNNNNnggc 


3 


3 


BsaBI 


GATNNnnatc 


3 


3 


BsrGI 


Tgtaca 


3 


3 


SnaBI 


TACgta 


3 


3 


Sse8387l 


CCTGCAgg 


3 


3 


ApaLI 


Gtgcac 


4 


4 


BspHI 


Tcatga 


4 


4 


BssSI 


Ctcgtg 


4 


4 


Psil 


TTAtaa 


4 


5 



HC Linker 

In linker, HC/anchor 



HC signal seq 



In HC signal seq 
HC FR4 



LC Signal/FRl 
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•••-=■ 

iJl 





SphI 


GCATGc 


*i 


A 
4 






Ahdl 


GACNNNnngtc 


c; 
O 


c 






BspEX 


Tccgga 


c 

i> 


c 






MscI 


TGGcca 


5 


c 

O 




5 


Sad 


GAGCTc 


5 


C 
O 






Seal 


AGTact 


5 


5 






SexAI 


Accwggt 


r 

5 


b 






Sspl 


AATatt 


5 


5 






Tlil 


Ctcgag 


5 


5 




10 


Xhol 


Ctcgag 


5 


r 
D 






Bbsl 


GAAGAC 


7 


8 






BstAPI 


GCANNNNntgc 


1 


o 
o 






BstZ17I 


GTAtac 


7 


7 






EcoRV 


z-' 7\ rp _ 4- _, 

(jrAiaCC 


7 


7 




iO 


CjCOKI 




8 


8 






BlpI 


GCtnagc 


9 


9 






Bsu36I 


LL-unagg 


9 


9 






Dralll 


L.ACJNJMJNgL.g 


9 


9 






Espl 


GCtnagc 


9 


9 




o n 
zU 


StuI 


AbwCCu 


9 


13 






Xbal 


Tctaga 


Q 

.7 


9 


HC FR3 




Bspl20I 


Gggccc 


10 


11 


CHI 




Apal 


GGGCCc 


10 


11 


CHI 




PspOOMI 


Gggccc 


10 


11 




25 


BciVI 


GT AT C CNNNNNN 


11 


11 






Sail 


Gtcgac 


11 


12 






DrdI 


GACNNNNnngtc 


12 


12 






KasI 


Ggcgcc 


12 


12 






Xmal 


Cccggg 


12 


1 A 

14 




30 


Bglll 


Agatct 


14 


1 A 

14 






Hindi 


GTYrac 


16 


1 o 

lo 






BamHI 


Ggatcc 


1 / 


1 7 
1 / 






PflMI 


CCANNNNntgg 


17 


18 






BsmBI 


Nnnnnngagacg 


18 


Z 1 




35 


BstXI 


CCANNNNNntgg 


18 


19 


HC FR2 




XmnI 


GAANNnnttc 


18 


18 






SacII 


CCGCgg 


19 


19 






PstI 


CTGCAg 


20 


24 






PvuII 


CAGctg 


20 


22 




40 


Aval 


Cycgrg 


21 


24 






EagI 


Cggccg 


21 


22 






Aatll 


GACGTc 


22 


22 






BspMI 


ACCTGC 


27 








AccI 


GTmkac 


30 


43 




45 


Styl 


Ccwwgg 


o c 

6b 


A O 

4y 






AlwNI 


CAGNNNctg 




H4 






Bsal 


GGTCTCNnnnn 


38 


44 






PpuMI 


RGgwccy 


43 


46 






Bsgl 


GTGCAG 


44 


54 




50 


BseRI 


NNnnnnnnnnctcctc 


48 


60 






Ecil 


nnnnnnnnntccgcc 


52 


57 


HC Fr4, 47/79 have one 




BstEII 


Ggtnacc 


54 


61 




EcoO109I 


RGgnccy 


54 


86 






Bpml 


ctccag 


60 


121 




55 


Avail 


Ggwcc 


71 


140 
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Table 21: MALI A3 , annotated 

! MALI A3 9532 bases 

i 

1 aat get act act 
! gene ii continued 
49 cca aat gaa aat 



289 



385 
433 



RBS? . 



721 



832 ggtaattca ca 

I 

! Ml 

843 ATG att aaa gtt 
! Start gene V 

! S17 S20 

891 tct ggt gtt tct 

! 

! V35 

939 tgt tac gtt gat 

I 

! D50 

987 ctt gat gaa ggt 

i 

i L65 



Start gene VII 



att 


agt 


aga 


att 


gat 


gee 


acc 


ttt 


tea 


get 


cgc 


gee 


ata 


get 


aaa 


cag 


gtt 


att 


gac 


cat 


ttg 


cga 


aat 


gta 


act 


aaa 


tct 


act 


cgt 


teg 


cag 


aat 


tgg 


gaa 


tea 


act 


gaa 


act 


tec 


aga 


cac 


cgt 


act 


tta 


gtt 


gca 


tat 


tta 


eta 


cag 


cac 


cag 


att 


cag 


caa 


tta 


age 


tct 


aag 


cca 


acc 


tct 


tat 


caa 


aag 


gag 


caa 


tta 


aag 


gta 


etc 


tct 


ttg 


gag 


ttt 


get 


tec ggt 


ctg 


gtt 


cgc 


ttt 


gaa 


get 


cga 


tat 


ttg 


aag 


tct 


ttc 


ggg 


ctt 


cct 


ctt 


aat 




cgc 


ttt 


get 


tct 


gac 


tat 


aat 


agt 


cag ggt 


aaa 


gac 


tta 


tgg 


tea 


ttc 


teg 


ttt 


tct 


gaa 


ctg 


ttt 


aaa 




tea 


ATG 


aat 


att 


tat 


gac 


gat 


tec 


gca 


gta 


ttg 


gac 




Start gene x, ii continues 








tct 


aaa 


cat 


ttt 


act 


att 


acc 


ccc 


tct 


ggc 


aaa 


act 


tct 


cgc 


tat 


ttt 


ggt 


ttt 


tat 


cgt 


cgt 


ctg 


g La 


aac 


agt 


gtt 


get 


ctt 


act 


atg 


cct 


cgt 


aat 


tec 


ttt 


tgg 


gca 


tta 


gtt 


gaa 


tgt 


ggt 


att 


cct 


aaa 


tct 


caa 


ctg 


acc 


tgt 


aat 


aat 


gtt 


gtt 


ccg 


tta 


gtt 


cgt 






tct 


tec 


caa 


cgt 


cct 


gac 


tgg 


tat 


aat 


gag 


cca 


gtt 


TAA 
























End 


X & 


II 




















E5 










Q10 










1 1 J 




gaa 


att 


aaa 


cca 


tct 


caa 


gee 


caa 


ttt 


act 


act 


cgt 










P25 










E30 






cgt 


cag 


ggc 


aag 


cct 


tat 


tea 


ctg 


aat 


gag 


cag 


ctt 








E40 










V45 








ttg 


ggt 


aat 


gaa 


tat 


ccg 


gtt 


ctt 


gtc 


aag 


att 


act 






A55 










L60 










cag 


cca 


gee 


tat 


gcg 


cct 


ggt 


cTG 


TAC 


Acc 


gtt 


cat 












BsrGI. 










V70 










S75 










R80 


aaa 


gtt ggt 


cag 


ttc 


ggt 


tec 


ctt 


atg 


att 


gac 


cgt 


P85 




K87 


end 


of 


V 














ccg 


get 


aag 


TAA 


C 
















gcg 


gat 


ttc 


gac 


aca 


att 


tat 


cag 


gcg 


atg 






, gtt 


gta 


ctt 


tgt 


ttc 


gcg 


ctt 


ggt 


ata 


ate 




VII and IX overlap. 


















S2 


V3 


L4 


V5 










S10 




i agA TGA gt 


gtt 


tta 


gtg 


tat 


tct 


ttc 


gee 


tct 


ttc gtt 



End VII 
I start IX 

L13 WIS G20 T25 E29 

1242 tta ggt tgg tgc ctt cgt agt ggc att acg tat ttt acc cgt tta atg gaa 
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1293 act tec tc 



10 



15 



20 



25 



30 



35 



40 



45 



.... stop of IX, IX and VIII overlap by four bases 
1301 ATG aaa aag tct tta gtc etc aaa gec tct gta gec gtt get acc etc 
Start signal sequence of viii. 

1349 gtt ccg atg ctg tct ttc get get gag ggt gac gat ccc gca aaa gcg 

mature VIII > 

1397 gee ttt aac tec ctg caa gec tea gcg acc gaa tat ate ggt tat gcg 
1445 tgg gcg atg gtt gtt gtc att 

1466 gtc ggc gca act ate ggt ate aag ctg ttt aag 
1499 aaa ttc acc teg aaa gca ! 1515 



-35 



1517 



age tga taaaccgat acaattaaag gctccttttg 
-10 



1552 gagecttttt ttttGGAGAt ttt ! S.D. underlined 



< III signal sequence 

MKKLLFAIPLV 
1575 caac GTG aaa aaa tta tta ttc gca att cct tta gtt 

VPFYSHSAQ 
1612 gtt cct ttc tat tct cac aGT gcA Cag tCT 

ApaLI . . . 



1611 



1642 



1729 
1777 
1825 
1870 
1900 
1930 

1969 
2002 
2050 
2098 
2146 
2194 
2242 
2290 



GTC GTG ACG CAG CCG CCC TCA GTG TCT GGG GCC CCA GGG CAG 
AGG GTC ACC ATC TCC TGC ACT GGG AGC AGC TCC AAC ATC GGG GCA 
BstEII. . . 

GGT TAT GAT GTA CAC TGG TAC CAG CAG CTT CCA GGA ACA GCC CCC AAA 
CTC CTC ATC TAT GGT AAC AGC AAT CGG CCC TCA GGG GTC CCT GAC CGA 
TTC TCT GGC TCC AAG TCT GGC ACC TCA GCC TCC CTG GCC ATC ACT 
GGG CTC CAG GCT GAG GAT GAG GCT GAT TAT 
TAC TGC CAG TCC TAT GAC AGC AGC CTG AGT 
GGC CTT TAT GTC TTC GGA ACT GGG ACC AAG GTC ACC GTC 

BstEII. . . 

CTA GGT CAG CCC AAG GCC AAC CCC ACT GTC ACT 

CTG TTC CCG CCC TCC TCT GAG GAG CTC CAA GCC AAC AAG GCC ACA CTA 
GTG TGT CTG ATC AGT GAC TTC TAC CCG GGA GCT GTG ACA GTG GCC TGG 
AAG GCA GAT AGC AGC CCC GTC AAG GCG GGA GTG GAG ACC ACC ACA CCC 
TCC AAA CAA AGC AAC AAC AAG TAC GCG GCC AGC AGC TAT CTG AGC CTG 
ACG CCT GAG CAG TGG AAG TCC CAC AGA AGC TAC AGC TGC CAG GTC ACG 
CAT GAA GGG AGC ACC GTG GAG AAG ACA GTG GCC CCT ACA GAA TGT TCA 
TAA TAA ACCG CCTCCACCG G GCGCGCCA AT TCTATTTCAA GGAGACAGTC ATA 

AscI 



50 



2343 



PelB signal > 

MKYLLPTAAAGLLLL 
ATG AAA TAC CTA TTG CCT ACG GCA GCC GCT GGA TTG TTA TTA CTC 



16 17 18 19 20 
A A Q P A 
55 2388 gcG GCC cag ccG G CC 

Sfil.. 

NgoMI . . . (1/2) 
Ncol . . 



21 22 
M A 
atg g ec 
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2409 



FRl (DP47/V3-23) 

23 24 25 26 27 28 29 30 
EVQLLESG 
gaa | gtt 1 CAA j TTG i tt a | gag ( t ct [ ggt i 
I Mfel 1 



On 



I ' 



10 



15 



20 



FR1 

31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 
GGLVQPGGSLRLSCA 
2433 | ggc | ggt ) ctt j gtt | cag | cct | ggt | ggt | tct 1 tta ! cgt | ctt I tct | tgc | get i 

FRl >j . - .CDRl | FR2 

46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 
AS GFTFS S YAMSWVR 
2478 | get | TCC | GGA| ttc | act | ttc | tct | tCG | TAC | Get j atg | tct | tgg | gtt | cgC | 

| BspEI | | BsiWII (BstXI. 

FR2 >| . . . CDR2 

61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 
QAPGKGLEWVSAI SG 
2523 | CAa | get j ccT | GGt | aaa | ggt | ttg | gag | tgg | gtt | tct | get | ate | tct j ggt | 
...BstXI 1 



30 



35 



40 



45 



CDR2 | FR3 

76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 
SGGSTYYADSVKGRF 
2568 1 tct j ggt | ggc | agt | act j tac | tat | get 1 gac | tec j gtt | aaa ! ggt | cgc | ttc ! 



FR3 

91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 
TISRDNSKNTLYLQM 
2613 | act | ate | TCT | AGA| gac { aac 1 tct 1 aag | aat | act | etc | tac J ttg | cag j atg } 
I Xbal I 

FR3 >| 

106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 
NSLRAEDTAVYYCAK 
2658 S aac | agC | TTA 1 AGg ! get | gag | gac | aCT | GCA| Gtc | tac 1 tat | tgc | get | aaa | 
lAflll i 1 PstI J 

CDR3 | FR4 

121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 
DYEGTGYAFDIW.GQG 
2703 | gac | tat j gaa | ggt | act | ggt | tat 1 get j ttc | gaC j ATAj TGg I ggt | caa I ggt | 

I Ndel | (1/4) 



FR4 > j 

50 ! 136 137 138 139 140 141 142 

T M V T V S S 
274 8 1 act j atG | GTC | ACC j gt c j tct j agt 
I BstEII | 

From BstEII onwards, pV323 is same as pCESl, except as noted. 
55 ! BstEII sites may occur in light chains; not likely to be unique in final 
vector. 



143 144 145 146 147 148 149 150 151 152 
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10 



50 



55 



2769 



ASTKGPSVFP 
gcc tec acc aaG GGC CCa teg GTC TTC ccc 
Bspl20I. Bbsl. . . (2/2) 

Apal .... 



153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 
LAPSSKSTSGGTAAL 
2799 ctg gca ccC TCC TCc aag age acc tct ggg ggc aca gcg gcc ctg 
BseRI. . . (2/2) 









168 


169 


170 


171 


172 


173 


174 


175 


176 


177 


178 


179 


180 


181 


182 






j 


G 


C 


L 


V 


K 


D 


Y 


F 


P 


E 


P 


V 


T 


V 


S 






2844 


ggc 


tgc 


ctg 


GTC 


AAG 


GAC 


TAC 


TTC 


CCc 


gaA CCG 


GTg 


acg 


gtg 


teg 


























Agel . 












15 


j 




































j 


183 


184 


185 


186 


187 


188 


189 


190 


191 


192 


193 


194 


195 


196 


197 






; 


W 


N 


S 


G 


A 


L 


T 


S 


G 


V 


H 


T 


F 


P 


A 






2889 


tgg 


aac 


tea 


GGC 


GCC 


ctg 


acc 


age 


ggc 


gtc 


cac 


acc 


ttc 


ccg 


get 














Kasl. . . 


(1/4) 






















20 


; 




































j 


198 


199 


200 


201 


202 


203 


204 


205 


206 


207 


208 


209 


210 


211 


212 








V 


L 


Q 


S 


S 


G 


L 


Y 


S 


L 


S 


S 


V 


V 


T 






2934 


gtc 


eta 


cag 


tct 


age 


GGa 


etc 


tac 


tec 


etc 


age 


age 


gta 


gtg 


acc 














(Bsu36l. . . ) (knocked 


out) 
















25 


j 






































213 


214 


215 


216 


217 


218 


219 


220 


221 


222 


223 


224 


225 


226 


227 








V 


P , 


S 


S 


S 


L 


G 


T 


Q 


T 


Y 


I 


C 


N 


V 






2979 


gtg 


ccC 


tct 


tct 


age 


tTG Ggc 


acc 


cag 


acc 


tac 


ate 


tgc 


aac 


gtg 












(BstXI. . 






■ )N. 


B. destruction of BstXI 


& Bpml £ 


% 


30 








































228 


229 


230 


231 


232 


233 


234 


235 


236 


237 


238 


239 


240 


241 


242 








N 


H 


K 


P 


S 


N 


T 


K 


V 


D 


K 


K 


V 


E 


P 


w 




3024 


aat 


cac 


aag 


ccc 


age 


aac 


acc 


aag 


gtg gac 


aag 


aaa 


gtt 


gag 


ccc 


£-5*5 
^ ; 


35 




243 


244 


245 


























t : 






K 


S 


C 


A 


A 


A 


H 


H 


H 


H 


H 


H 


S 


A 




«■ 




3069 


aaa 


tct 


tgt 


GCG 


GCC 


GCt 


cat 


cac 


cac 


cat 


cat 


cac 


tct 


get 




N ! 












Notl. . . 
























40 




E 


Q 


K 


L 


I 


S 


E 


E 


D 


L 


N 


G 


A 


A 








3111 


gaa 


caa 


aaa 


etc 


ate 


tea 


gaa 


gag 


gat 


ctg aat 


ggt 


gcc 


gca 










D 


I 


N 


D 


D 


R 


M 


A S 


G A 










45 


3153 


GAT 


ATC 


aac 


gat 


gat 


cgt 


atg 


get AGC ggc gcc 









rEK cleavage site. 
EcoRV. . 



Nhel. 



Kasl. . . 



Domain 1 

AETVESCLA 
3183 get gaa act gtt gaa agt tgt tta gca 



KPHTEISF 
3210 aaa ccc cat aca gaa aat tea ttt 

TNVWKDDKT 
3234 aCT AAC GTC TGG AAA GAC GAC AAA ACt 
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LDRYANYEGCLWNATGV 
3261 tta gat cgt tac get aac tat gag ggt tgt ctg tgG AAT GCt aca ggc gtt 

BsmI 

VVCTGDETQCYGTWVP I 
3312 gta gtt tgt act ggt GAC GAA ACT CAG TGT TAC GGT ACA TGG GTT cct att 



10 



15 



G L A I PEN 
3363 ggg ctt get ate cct gaa aat 

LI linker 

EGGGSEGGGS 
3384 gag ggt ggt ggc tct gag ggt ggc ggt tct 

EGGGSEGGGT 
3414 gag ggt ggc ggt tct gag ggt ggc ggt act 



W 



! Domain 2 



TO? 1 



20 



25 



30 



35 



40 



aaa 


cct 


cct 


gag 


tac ggt 


gat 


aca 


cct 


att 


ccg 


ggc 


tat 


act 


tat 


ate 


aac 


cct 


etc 


gac 


ggc 


act tat 


ccg 


cct 


ggt 


act 


gag 


caa 


aac 


ccc 


get 


aat 


cct 


aat 


cct 


tct 


ctt 


GAG GAG 
BseRI 


tct 


cag 


cct 


ctt 


aat 


act 


ttc 


atg 


ttt 


cag 


aat 


aat 


agg 


ttc 


cga 


aat agg 


cag 


ggg 


gca 


tta 


act 


gtt 


tat 


acg 


ggc 


act 




gtt 


act 


caa 


ggc 


act gac 


ccc 


gtt 


aaa 


act 


tat 


tac 


cag 


tac 


act 


cct 




gta 


tea 


tea 


aaa 


gec atg 


tat 


gac 


get 


tac tgg 


aac 


ggt 


aaa 


ttc 


AGA 






























AlwNI 




GAC 


TGc 


get 


ttc 


cat tct 


ggc 


ttt 


aat 


gaa 


gat 


cca 


ttc 


gtt 


tgt 


gaa 




AlwNI 






























tat 


caa 


ggc 


caa 


teg tct 


gac 


ctg 


cct 


caa 


cct 


cct 


gtc 


aat 


get 







3834 ggc ggc ggc tct 

start L2 

3846 ggt ggt ggt tct 
3858 ggt ggc ggc tct 

3870 gag ggt ggt ggc tct gag ggt ggc ggt tct 
3900 gag ggt ggc ggc tct gag gga ggc ggt tec 
3930 ggt ggt ggc tct ggt ! end L2 



Domain 3 







S 


G 


D 


F 


D 


Y 


E 


K 


M 


A 


N 


A 


N 


K 


G 


A 




3945 


tec 


ggt 


gat 


ttt 


gat 


tat 


gaa 


aag 


atg 


gca 


aac 


get 


aat 


aag 


ggg 


get 






M 


T 


E 


N 


A 


D 


E 


N 


A 


L 


Q 


S 


D 


A 


K 


G 


45 


3993 


atg 


acc 


gaa 


aat 


gec 


gat 


gaa 


aac 


gcg 


eta 


cag 


tct 


gac 


get 


aaa 


ggc 






K 


L 


D 


S 


V 


A 


T 


D 


Y 


G 


A 


A 


I 


D 


G 


F 




4041 


aaa 


ctt 


gat 


tct 


gtc 


get 


act 


gat 


tac 


ggt 


get 


get 


ate 


gat 


ggt 


ttc 


50 




I 


G 


D 


V 


S 


G 


L 


A 


N 


G 


N 


G 


A 


T 


G 


D 




4089 


att 


ggt 


gac 


gtt 


tec 


ggc 


ctt 


get 


aat 


ggt 


aat 


ggt 


get 


act 


ggt 


gat 






F 


A 


G 


S 


N 


S 


Q 


M 


A 


Q 


V 


G 


D 


G 


D 


N 




4137 


ttt 


get 


ggc 


tct 


aat 


tec 


caa 


atg 


get 


caa 


gtc 


ggt 


gac 


ggt 


gat 


aat 


55 




























S 


P 


L 


M 


N 


N 


F 


R 


Q 


Y 


L 


P 


S 


L 


P 


Q 




4185 


tea 


cct 


tta 


atg 


aat 


aat 


ttc 


cgt 


caa 


tat 


tta 


cct 


tec 


etc 


cct 


caa 
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SVECRPFVFSAGKPYE 
4233 teg gtt gaa tgt cgc cct ttt gtc ttt age get ggt aaa cca tat gaa 

FSIDCDKINLFR 
4281 ttt tct att gat tgt gac aaa ata aac tta ttc cgt 

End Domain 3 





G V F A F 


L 


L 


Y 


V 


A 


T 


H 1 


M 


y 


V 


F14I 


4317 


ggt gtc ttt gcg ttt 


ctt 


tta 


tat 


gtt 


gee 


ace 


ttt 


atg 


tat 


ata 


ttt 




start transmembrane 


segment 






















S T F A N 


I 


L 




















4365 


tct acg ttt get aac 


ata 


ctg 






















R N K E S 
























4386 


cgt aat aag gag tct 


TAA 


! stop of iii 














Intracellular anchor, 


























Ml P2 V L 


L5 


G 


I 


P 


L 


L10 


L 


R 


F 


L 


G15 


4404 


tc ATG cca gtt ctt ttg 


ggt 


att 


ccg 


tta 


tta 


ttg 


cgt 


ttc 


etc 


ggt 




Start VI 
























4451 


ttc ctt ctg gta act 


ttg 


ttc 


ggc 


tat 


ctg 


ctt 


act 


ttt 


ctt 


aaa 


aag 


4499 


ggc ttc ggt aag ata 


get 


att 


get 


att 


tea 


ttg 


ttt 


ctt 


get 


ctt 


att 


4547 


att ggg ctt aac tea 


att 


ctt 


gtg 


ggt 


tat 


etc 


tct 


gat 


att 


age 


get 


4595 


caa tta ccc tct gac 


ttt 


gtt 


cag 


ggt 


gtt 


cag 


tta 


att 


etc 


ccg 


tct 


4643 


aat gcg ctt ccc tgt 


ttt 


tat 


gtt 


att 


etc 


tct 


gta 


aag 


get 


get 


att 


4691 


ttc att ttt gac gtt 


aaa 


caa 


aaa 


ate 


gtt tct 


tat 


ttg 


gat 


tgg 


gat 




Ml A2 V3 


F5 








L10 




G13 



4739 aaa TAA t ATG get gtt tat ttt gta act ggc aaa tta ggc tct gga 
end VI Start gene I 





14 


15 


16 


17 


18 


19 


20 


21 


22 


23 


24 


25 


26 


27 


28 




K 


T 


L 


V 


S 


V 


G 


K 


I 


Q 


D 


K 


I 


V 


A 


4785 


aag 


acg 


etc 


gtt 


age 


gtt 


ggt 


aag 


att 


cag 


gat 


aaa 


att 


gta 


get 




29 


30 


31 


32 


33 


34 


35 


36 


37 


38 


39 


40 


41 


42 


43 




G 


C 


K 


I 


A 


T 


N 


L 


D 


L 


R 


L 


Q 


N 


L 


4830 


ggg 


tgc 


aaa 


ata 


gca 


act 


aat 


ctt 


gat 


tta 


agg 


ctt 


caa 


aac 


etc 




44 


45 


46 


47 


48 


49 


50 


51 


52 


53 


54 


55 


56 


57 


58 




P 


Q 


V 


G 


R 


F 


A 


K 


T 


P 


R 


V 


L 


R 


I 


4875 


ccg 


caa 


gtc 


ggg 


agg 


ttc 


get 


aaa 


acg 


cct 


cgc 


gtt 


ctt 


aga 


ata 




59 


60 


61 


62 


63 


64 


65 


66 


67 


68 


69 


70 


71 


72 


73 




P 


D 


K 


p 


S 


I 


S 


D 


L 


L 


A 


I 


G 


R 


G 


4920 


ccg 


gat 


aag 


cct 


tct 


ata 


tct 


gat 


ttg 


ctt 


get 


att 


ggg 


cgc 


ggt 




74 


75 


76 


77 


78 


79 


80 


81 


82 


83 


84 


85 


86 


87 


88 




N 


D 


S 


Y 


D 


E 


N 


K 


N 


G 


L 


L 


V 


L 


D 


4965 


aat 


gat 


tec 


tac 


gat 


gaa 


aat 


aaa 


aac 


ggc 


ttg 


ctt 


gtt 


etc 


gat 




89 


90 


91 


92 


93 


94 


95 


96 


97 


98 


99 


100 


101 


102 


103 




E 


C 


G 


T 


W 


F 


N 


T 


R 


S 


W 


N 


D 


K 


E 


5010 


gag 


tgc 


ggt 


act 


tgg 


ttt 


aat 


ace 


cgt 


tct 


tgg 


aat 


gat 


aag 


gaa 



104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 
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J 


R 


Q 


P 


I 


I 


D 


W 


F 


L 


H 


A 


R 


K 


L 


G 






5055 

1 


aga 


cag 


ccg 


att 


att 


gat 


tgg 


ttt 


eta 


cat 


get 


cgt 


aaa 


tta 


gga 






! 


119 


120 


121 


122 


123 


124 


125 


126 


127 


128 


129 


130 


131 


132 


133 




5 


! 


W 


D 


I 


I 


F 


L 


V 


Q 


D 


L 


S 


I 


V 


D 


K 






5100 

j 


tgg 


gat 


att 


att 


ttt 


ctt 


gtt 


cag 


gac 


tta 


tct 


att 


gtt 


gat 


aaa 






f 


134 


135 


136 


137 


138 


139 


140 


141 


142 


143 


144 


145 


146 


147 


148 






\ 


Q 


A 


R 


S 


A 


L 


A 


E 


H 


V 


V 


Y 


C 


R 


R 




10 


5145 

t 


cag 


gcg 


cgt 


tct 


gca 


tta 


get 


gaa 


cat 


gtt gtt 


tat 


tgt 


cgt 


cgt 






i 


149 


150 


151 


152 


153 


154 


155 


156 


157 


158 


159 


160 


161 


162 


163 








L 


D 


R 


I 


T 


L 


P 


F 


V 


G 


T 


L 


Y 


S 


L 






5190 


ctg 


gac 


aga 


att 


act 


tta 


cct 


ttt 


gtc 


ggt 


act 


tta 


tat 


tct 


ctt 




15 


I 
































! 


164 


165 


166 


167 


168 


169 


170 


171 


172 


173 


174 


175 


176 


177 


178 






i 


I 


T 


G 


S 


K 


M 


P 


L 


P 


K 


L 


H 


V 


G 


V 






5235 

! 


att 


act 


ggc 


teg 


aaa 


atg 


cct 


ctg 


cct 


aaa 


tta 


cat 


gtt 


ggc 


gtt 




20 


f 


179 


180 


181 


182 


183 


184 


185 


186 


187 


188 


189 


190 


191 


192 


193 


n : 




t 


V 


K 


Y 


G 


D 


S 


Q 


L 


S 


P 


T 


V 


E 


R 


W 






5280 

j 


att 


aaa 


tat 


aac 

3 3^ 


aat 


tct 


caa 


tta 


aac 


cct 


act 


gtt 


gag 


cgt 


tgg 






j 


194 


195 


196 


197 


198 


199 


200 


201 


202 


203 


204 


205 


206 


207 


208 


1 V: 


25 


{ 


L 


Y 


T 


G 


K 


N 


L 


Y 


N 


A 


Y 


D 


T 


K 


Q 


ff) 

%. ; 




5325 

f 


ctt 


tat 


act 


aat 

3 3 


aag 


aat 


ttg 

3 


tat 


aac 


gca 


tat 


qat 


act 


aaa 


cag 






t 


209 


210 


211 


212 


213 


214 


215 


216 


217 


218 


219 


220 


221 


222 


223 






j 


A 


F 


S 


S 


N 


Y 


D 


S 


G 


V 


Y 


S 


Y 


L 


T 


M- 


30 


5370 

i 


get 


ttt 


tct 


agt 


aat 


tat 


gat 


tec 


ggt 


qtt 


tat 


tct 


tat 


tta 


acg 






j 


224 


225 


226 


227 


228 


229 


230 


231 


232 


233 


234 


235 


236 


237 


238 


j; 






P 


Y 


L 


S 


H 


G 


R 


Y 


F 


K 


P 


L 


N 


L 


G 






5415 


cct 


tat 


tta 


tea 


cac 


ggt 


egg 


tat 


ttc 


aaa 


cca 


tta 


aat 


tta 


ggt 


S 4S" : 


35 


» 
































j 


239 


240 


241 


242 


243 


244 


245 


246 


247 


248 


249 


250 


251 


252 


253 






j 


Q 


K 


M 


K 


L 


T 


K 


I 


Y 


L 


K 


K 


F 


S 


R 






5460 

j 


caq 


aaa 


ata 
3 


aaa 


tta 


act 


aaa 


ata 


tat 


ttg 


aaa 


aag 


ttt 


tct 


cgc 




40 


1 


254 


255 


256 


257 


258 


259 


260 


261 


262 


263 


264 


265 


266 


267 


268 






1 


V 


L 


C 


L 


A 


I 


G 


F 


A 


S 


A 


F 


T 


Y 


S 






5505 

j 


qtt 


ctt 


tat 


ctt 


acq 

3 *■* 3 


att 


qaa 


ttt 


gca 


tea 


gca 


ttt 


aca 


tat 


agt 






j 


269 


270 


271 


272 


273 


274 


275 


276 


277 


278 


279 


280 


281 


282 


283 




45 


! 


Y 


I 


T 


Q 


P 


K 


P 


E 


V 


K 


K 


V 


V 


S 


Q 






5550 

! 


tat 


ata 


acc 


caa 


cct 


aag 


ccg 


gag 


gtt 


aaa 


aag 


gta 


gtc 


tct 


cag 






1 


284 


285 


286 


287 


288 


289 


290 


291 


292 


293 


294 


295 


296 


297 


298 






1 


T 


Y 


D 


F 


D 


K 


F 


T 


I 


D 


S 


S 


Q 


R 


L 




50 


5595 

i 


acc 


tat 


gat 


ttt 


gat 


aaa 


ttc 


act 


att 


gac 


tct 


tct 


cag 


cgt 


ctt 






! 


299 


300 


301 


302 


303 


304 


305 


306 


307 


308 


309 


310 


311 


312 


313 






f 


N 


L 


S 


Y 


R 


Y 


V 


F 


K 


D 


S 


K 


G 


K 


L 






5640 


aat 


eta 


age 


tat 


cgc 


tat 


gtt 


ttc 


aag 


gat 


tct 


aag 


gga 


aaa 


TTA 




55 


j 






























Pac] 






{ 
i 


314 


315 


316 


317 


318 


319 


320 


321 


322 


323 


324 


325 


326 


327 


328 






j 


I 


N 


S 


D 


D 


L 


Q 


K 


Q 


G 


Y 


S 


L 


T 


Y 



- 203 - 



5685 ATT AAt age gac gat tta cag aag caa ggt tat tea etc aca tat 
PacI 

329 330 331 332 333 334 335 336 337 338 339 340 341 342 343 
ilDLCTVSIKKGNSNE 
iv Ml K 

5730 att gat tta tgt act gtt tec att aaa aaa ggt aat tea aAT Gaa 

Start IV 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



344 345 346 347 348 349 
i I V K C N .End of I 
iv L3 L N5 V 17 N F V10 
5775 att gtt aaa tgt aat TAA T TTT GTT 

IV continued 

ttc ttg atg ttt gtt tea tea tct tct ttt 



5800 
5848 



get 



aat aat teg cct ctg cgc gat ttt gta act tgg 



5896 ggc gaa tec gtt att gtt tct ccc gat gta aaa 
5944 tat tea tct gac gtt aaa cct gaa aat eta cgc 
5992 gtt tta cgt get aat aat ttt gat atg gtt ggt 
6040 att cag aag tat aat cca aac aat cag gat tat 
6088 tea tct gat aat cag gaa tat gat gat aat tec 
6136 ttc ttt gtt ccg caa aat gat aat gtt act caa 
6184 
6232 
6280 



6328 

637 6 ttg ata ttt 



aac gtt egg gca aag gat tta ata cga gtt gtc 
tct aat act tct aaa tec tea aat gta tta tct 
eta tta gtt gtt TCT gca cct aaa gat att tta 

ApaLI removed 
ttc ctt tct act gtt gat ttg cca act gac cag 
gag gtt cag caa ggt gat get tta 
6424 get ggc tct cag cgt ggc act gtt gca ggc ggt 
6472 etc ace tct gtt tta tct tct get ggt ggt teg 
6520 ggc gat gtt tta ggg eta tea gtt cgc gca tta 
6568 tea aaa ata ttg tct gtg cca cgt att ctt acg 
6616 ggt tct ate tct gtT GGC CAg aat gtc cct ttt 

MscI 

6664 act ggt gaa tct gec aat gta aat aat cca ttt 
6712 caa aat gta ggt att tec atg age gtt ttt cct 
6760 ggt aat att gtt ctg gat att acc age aag gec 
6808 tct act cag gca agt gat gtt att act aat caa 
6856 acg gtt aat ttg cgt gat gga cag act ctt tta 
6904 gat tat aaa aac act tct caa gat tct ggc gta 
6952 ate cct tta ate ggc etc ctg ttt age tec cgc 
7000 gaa age acg tta tac gtg etc gtc aaa gca acc 
7048 TAG eggegcatt 
End IV 

7060 aagcgcggcg ggtgtggtgg ttacgcgcag cgtgaccgct 
7120 gcccgctcct ttegctttet tcccttcctt tctcgccacg 



cag gta 
tat tea 
ggt act 
aat ttc 
tea att 
att gat 
get cct 
act ttt 
gaa ttg 
att gac 
gat aac 



att 
aag 
gtt 
ttt 
cct 
gaa 
tct 
aaa 
ttt 

ggc 

ctt 



gaa 
caa 
act 
att 
tec 
ttg 
ggt 
att 
gta 
tct 
cct 



atg 
tea 
gta 
tct 
ata 
cca 
ggt 
aat 
aag 
aat 
caa 



ata ttg att 
gat ttt tea 
gtt aat act 
ttc ggt att 
aag act aat 
ctt tea ggt 
att act ggt 



gag ggt 
ttt get 
gac cgc 
ttt aat 
age cat 
cag aag 
cgt gtg 



cag acg 
gtt gca 
gat agt 
aga agt 
etc ggt 
ccg ttc 
tct gat 
ata gta 



att gag cgt 
atg get ggc 
ttg agt tct 
att get aca 
ggc etc act 
ctg tct aaa 
tec aac gag 
cgc gee ctg 



acacttgcca gcgccctagc 
ttcGCCGGCt ttccccgtca 

NgoMI__ 
getttaegge acctcgaccc 
tcgccctgat agacggtttt 



7180 agctctaaat cgggggctcc ctttagggtt ccgatttagt 
724 0 caaaaaactt gatttgggtg atggttCACG TAGTGggcca 

Dralll 

7300 tcgccctttG ACGTTGGAGT Ccacgttctt taatagtgga ctcttgttcc aaactggaac 
DrdI 



7360 aacactcaac cctatctcgg gctattcttt 
7420 accaccatca aacaggattt tcgcctgctg 
7480 ctctctcagg gecaggeggt gaagggcaat 



tgatttataa gggattttgc egatttegga 
gggcaaacca gcgtggaccg ettgetgeaa 
CAGCTGttgc cCGTCTCact ggtgaaaaga 
PvuII. BsmBI. 



7540 aaaaccaccc tGGATCC AAGCTT 

BamHI Hindi II (1/2) 
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Insert carrying bla gene 
7563 gcaggtg gcacttttcg gggaaatgtg cgcggaaccc 

7600 ctatttgttt atttttctaa atacattcaa atatGTATCC gctcatgaga caataaccct 

BciVI 

7660 gataaatgct tcaataatat tgaaaaAGGA AGAgt 

BBS.?... 

Start bla gene 

7695 ATG agt att caa cat ttc cgt gtc gcc ctt att ccc ttt ttt gcg gca ttt 
77 46 tgc ctt cct gtt ttt get cac cca gaa acg ctg gtg aaa gta aaa gat get 
7797 gaa gat cag ttg ggC gCA CGA Gtg ggt tac ate gaa ctg gat etc aac age 

BssSI . . . 
ApaLI removed 



7848 


ggt 


aag 


ate 


ctt 


gag 


agt 


ttt 


cgc 


ccc 


gaa 


gaa 


cgt 


ttt 


cca 


atg 


atg 


age 


7899 


act 


ttt 


aaa 


gtt 


ctg 


eta 


tgt 


cat 


aca 


eta 


tta 


tec 


cgt 


att 


gac 


gcc 


ggg 


7950 


caa 


gaG 


CAA 


CTC 


GGT 


CGc 


egg 


gcg 


egg 


tat 


tct 


cag 


aat 


gac 


ttg 


gtt 


gAG 






Bcgl 




























Sc< 


8001 


TAC 


Tea 


cca 


gtc 


aca 


gaa 


aag 


cat 


ctt 


acg 


gat 


ggc 


atg 


aca 


gta 


aga 


gaa 




Seal 
































8052 


tta 


tgc 


agt 


get 


gcc 


ata 


ace 


atg 


agt 


gat 


aac 


act 


gcg 


gcc 


aac 


tta 


ctt 


8103 


ctg 


aca 


aCG ATC Gga 


gga 


ccg 


aag 


gag 


eta 


acc 


get 


ttt 


ttg 


cac 


aac 


atg 








Pvul 




























8154 


ggg 


gat 


cat 


gta 


act 


cgc 


ctt 


gat 


cgt 


tgg 


gaa 


ccg 


gag 


ctg 


aat 


gaa 


gcc 


8205 


ata 


cca 


aac 


gac 


gag 


cgt 


gac 


ace 


acg 


atg 


cct 


gta 


gca 


atg 


cca 


aca 


acg 


8256 


tTG 


CGC 


Aaa 


eta 


tta 


act 


ggc 


gaa 


eta 


ctt 


act 


eta 


get 


tec 


egg 


caa 


caa 



Fspl 



8307 
8358 


tta ata 
GCC ctt 
Bgll 


gac 
ccG 


tgg 
GCt 


atg 
ggc 


gag gcg 
tgg ttt 


gat 
att 


aaa 
get 


gtt 
gat 


gca 
aaa 


gga 
tct 


cca 
gga 


ctt ctg 
gcc ggt 


cgc 
gag 


teg 
cgt 


8409 


gGG TCT 

Bsal 
ate gta 


Cgc ggt 


ate 


att gca 


gca 


ctg 


ggg 


cca 


gat 


ggt 


aag ccc 


tec 


cgt 


8460 


gtt 


ate 


tac 


acG ACg 
Ahdl 


ggg 


aGT 


Cag 


gca 


act 


atg 


gat gaa 


cga 


aat 


8511 


aga cag 


ate 


get 


gag 


ata ggt 


gcc 


tea 


ctg 


att 


aag 


cat 


tgg TAA 


ctgt 



stop 

8560 cagaccaagt ttactcatat atactttaga ttgatttaaa acttcatttt taatttaaaa 
8620 ggatctaggt gaagatcctt tttgataatc tcatgaccaa aatcccttaa cgtgagtttt 
8680 cgttccactg taegtaagae cccc 
8704 AAGCTT GTCGAC tgaa tggcgaatgg cgctttgcct 

Hindlll Sail.. 

(2/2) Hindi 

8740 ggtttccggc accagaagcg gtgccggaaa gctggctgga gtgegatett 



8790 CCTGAGG 
Bsu36l_ 
8797 ccgat 
8832 cacggttacg 
8892 tttgttccca 
8952 tggctacagg 
9012 agctgattta 



actgtegteg 
atgcgcccat 
eggagaatec 
aaggecagae 
acaaaaattt 



tcccctcaaa 
ctacaccaac 
gacgggttgt 
gcgaattatt 
aacgegaatt 



ctggcagatg 
gtaacctatc 
tactcgctca 
tttgatggcg 
ttaacaaaat 



ecattaeggt 
catttaatgt 
ttcctattgg 
attaacgttt 



9072 Tatttgctta tacaatcttc ctgtttttgg ggcttttctg attatcaacc 



caatccgccg 
tgatgaaagc 
ttaaaaaatg 
acaATTTAAA 
Swal . . . 
GGGGTAcat 
RBS? 



9131 ATG att gac atg eta gtt tta cga tta ccg ttc ate gat tct ctt gtt tgc 

Start gene II 
9182 tec aga etc tea ggc aat gac ctg ata gcc ttt 



9233 get acc etc tec ggc atg aat tta tea get aga 



gtA GAT CTc tea aaa ata 

Bglll. . . 
acg gtt gaa tat cat att 
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9284 gat ggt gat ttg act 
9335 aca cat tac tea ggc 
9386 tat cct tgc gtt gaa 
9437 aat gtt ttt ggt aca 
9488 aat ttt get aat tct 
gene II continues 



gtc tec ggc ctt tct cac 
att gca ttt aaa ata tat 
ata aag get tct ccc gca 
ace gat tta get tta tgc 
ttg cct tgc ctg tat gat 



cct ttt gaa tct tta cct 
gag ggt tct aaa aat ttt 
aaa gta tta cag ggt cat 
tct gag get tta ttg ctt 
tta ttg gat gtt ! 9532 
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Table 2 IB: Sequence of MALI A3 r condensed 

LOCUS MALI A3 9532 CIRCULAR 

ORIGIN 



1 AATGCTACTA CTATTAGTAG AATTGATGCC ACCTTTTCAG CTCGCGCCCC AAATGAAAAT 

5 61 ATAGCTAAAC AGGTTATTGA CCATTTGCGA AATGTATCTA ATGGTCAAAC TAAATCTACT 

121 CGTTCGCAGA ATTGGGAATC AACTGTTACA TGGAATGAAA CTTCCAGACA CCGTACTTTA 

181 GTTGCATATT TAAAACATGT TGAGCTACAG CACCAGATTC AGCAATTAAG CTCTAAGCCA 

241 TCCGCAAAAA TGACCTCTTA TCAAAAGGAG CAATTAAAGG TACTCTCTAA TCCTGACCTG 

361 TCTTTCGGGC TTCCTCTTAA TCTTTTTGAT GCAATCCGCT TTGCTTCTGA CTATAATAGT 

10 421 CAGGGTAAAG ACCTGATTTT TGATTTATGG TCATTCTCGT TTTCTGAACT GTTTAAAGCA 

481 TTTGAGGGGG ATTCAATGAA TATTTATGAC GATTCCGCAG TATTGGACGC TATCCAGTCT 

541 AAACATTTTA CTATTACCCC CTCTGGCAAA ACTTCTTTTG CAAAAGCCTC TCGCTATTTT 

601 GGTTTTTATC GTCGTCTGGT AAACGAGGGT TATGATAGTG TTGCTCTTAC TATGCCTCGT 

661 AATTCCTTTT GGCGTTATGT ATCTGCATTA GTTGAATGTG GTATTCCTAA ATCTCAACTG 

15 721 ATGAATCTTT CTACCTGTAA TAATGTTGTT CCGTTAGTTC GTTTTATTAA CGTAGATTTT 

781 TCTTCCCAAC GTCCTGACTG GTATAATGAG CCAGTTCTTA AAATCGCATA AGGTAATTCA 

841 CAATGATTAA AGTTGAAATT AAACCATCTC AAGCCCAATT TACTACTCGT TCTGGTGTTT 

y. 901 CTCGTCAGGG CAAGCCTTAT TCACTGAATG AGCAGCTTTG TTACGTTGAT TTGGGTAATG 

961 AATATCCGGT TCTTGTCAAG ATTACTCTTG ATGAAGGTCA GCCAGCCTAT GCGCCTGGTC 

U 20 1021 TGTACACCGT TCATCTGTCC TCTTTCAAAG TTGGTCAGTT CGGTTCCCTT ATGATTGACC 

© 1081 GTCTGCGCCT CGTTCCGGCT AAGTAACATG GAGCAGGTCG CGGATTTCGA CACAATTTAT 

M 1141 CAGGCGATGA TACAAATCTC CGTTGTACTT TGTTTCGCGC TTGGTATAAT CGCTGGGGGT 

\f\ 1201 CAAAGATGAG TGTTTTAGTG TATTCTTTCG CCTCTTTCGT TTTAGGTTGG TGCCTTCGTA 

1261 GTGGCATTAC GTATTTTACC CGTTTAATGG AAACTTCCTC ATGAAAAAGT CTTTAGTCCT 

25 1321 CAAAGCCTCT GTAGCCGTTG CTACCCTCGT TCCGATGCTG TCTTTCGCTG CTGAGGGTGA 

1381 CGATCCCGCA AAAGCGGCCT TTAACTCCCT GCAAGCCTCA GCGACCGAAT ATATCGGTTA 

J!; 1441 TGCGTGGGCG ATGGTTGTTG TCATTGTCGG CGCAACTATC GGTATCAAGC TGTTTAAGAA 

1501 ATTCACCTCG AAAGCAAGCT GATAAACCGA TACAATTAAA GGCTCCTTTT GGAGCCTTTT 

; 1561 TTTTTGGAGA TTTTCAACGT GAAAAAATTA TTATTCGCAA TTCCTTTAGT TGTTCCTTTC 

N- 30 1621 TATTCTCACA GTGCACAGTC TGTCGTGACG CAGCCGCCCT CAGTGTCTGG GGCCCCAGGG 

gj 1681 CAGAGGGTCA CCATCTCCTG CACTGGGAGC AGCTCCAACA TCGGGGCAGG TTATGATGTA 

fl'i 1741 CACTGGTACC AGCAGCTTCC AGGAACAGCC CCCAAACTCC TCATCTATGG TAACAGCAAT 

52I 1801 CGGCCCTCAG GGGTCCCTGA CCGATTCTCT GGCTCCAAGT CTGGCACCTC AGCCTCCCTG 

jgj 1861 GCCATCACTG GGCTCCAGGC TGAGGATGAG GCTGATTATT ACTGCCAGTC CTATGACAGC 

O 35 1921 AGCCTGAGTG GCCTTTATGT CTTCGGAACT GGGACCAAGG TCACCGTCCT AGGTCAGCCC 

y, 1981 AAGGCCAACC CCACTGTCAC TCTGTTCCCG CCCTCCTCTG AGGAGCTCCA AGCCAACAAG 

2041 GCCACACTAG TGTGTCTGAT CAGTGACTTC TACCCGGGAG CTGTGACAGT GGCCTGGAAG 

2101 GCAGATAGCA GCCCCGTCAA GGCGGGAGTG GAGACCACCA CACCCTCCAA ACAAAGCAAC 

2161 AACAAGTACG CGGCCAGCAG CTATCTGAGC CTGACGCCTG AGCAGTGGAA GTCCCACAGA 

4 0 2221 AGCTACAGCT GCCAGGTCAC GCATGAAGGG AGCACCGTGG AGAAGACAGT GGCCCCTACA 

2281 GAATGTTCAT AATAAACCGC CTCCACCGGG CGCGCCAATT CTATTTCAAG GAGACAGTCA 

2341 TAATGAAATA CCTATTGCCT ACGGCAGCCG CTGGATTGTT ATTACTCGCG GCCCAGCCGG 

2401 CCATGGCCGA AGTTCAATTG TTAGAGTCTG GTGGCGGTCT TGTTCAGCCT GGTGGTTCTT 

2461 TACGTCTTTC TTGCGCTGCT TCCGGATTCA CTTTCTCTTC GTACGCTATG TCTTGGGTTC 

45 2521 GCCAAGCTCC TGGTAAAGGT TTGGAGTGGG TTTCTGCTAT CTCTGGTTCT GGTGGCAGTA 

2581 CTTACTATGC TGACTCCGTT AAAGGTCGCT TCACTATCTC TAGAGACAAC TCTAAGAATA 

2641 CTCTCTACTT GCAGATGAAC AGCTTAAGGG CTGAGGACAC TGCAGTCTAC TATTGCGCTA 

2701 AAGACTATGA AGGTACTGGT TATGCTTTCG ACATATGGGG TCAAGGTACT ATGGTCACCG 

27 61 TCTCTAGTGC CTCCACCAAG GGCCCATCGG TCTTCCCCCT GGCACCCTCC TCCAAGAGCA 

50 2821 CCTCTGGGGG CACAGCGGCC CTGGGCTGCC TGGTCAAGGA CTACTTCCCC GAACCGGTGA 

2881 CGGTGTCGTG GAACTCAGGC GCCCTGACCA GCGGCGTCCA CACCTTCCCG GCTGTCCTAC 

2941 AGTCTAGCGG ACTCTACTCC CTCAGCAGCG TAGTGACCGT GCCCTCTTCT AGCTTGGGCA 

3001 CCCAGACCTA CATCTGCAAC GTGAATCACA AGCCCAGCAA CACCAAGGTG GACAAGAAAG 

3061 TTGAGCCCAA ATCTTGTGCG GCCGCTCATC ACCACCATCA TCACTCTGCT GAACAAAAAC 

55 3121 TCATCTCAGA AGAGGATCTG AATGGTGCCG CAGATATCAA CGATGATCGT ATGGCTGGCG 

3181 CCGCTGAAAC TGTTGAAAGT TGTTTAGCAA AACCCCATAC AGAAAATTCA TTTACTAACG 

3241 TCTGGAAAGA CGACAAAACT TTAGATCGTT ACGCTAACTA TGAGGGTTGT CTGTGGAATG 
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3301 CTACAGGCGT TGTAGTTTGT ACTGGTGACG AAACTCAGTG TTACGGTACA TGGGTTCCTA 

3361 TTGGGCTTGC TATCCCTGAA AATGAGGGTG GTGGCTCTGA GGGTGGCGGT TCTGAGGGTG 

3421 GCGGTTCTGA GGGTGGCGGT ACTAAACCTC CTGAGTACGG TGATACACCT ATTCCGGGCT 

3481 AT ACT TAT AT CAACCCTCTC GACGGCACTT ATCCGCCTGG TACTGAGCAA AACCCCGCTA 

5 3541 ATCCTAATCC TTCTCTTGAG GAGTCTCAGC CTCTTAATAC TTTCATGTTT CAGAATAATA 

3601 GGTTCCGAAA TAGGCAGGGG GCATTAACTG TTTATACGGG CACTGTTACT CAAGGCACTG 

3661 ACCCCGTTAA AACTTATTAC CAGTACACTC CTGTATCATC AAAAGCCATG TATGACGCTT 

3721 ACTGGAACGG TAAATTCAGA GACTGCGCTT TCCATTCTGG CTTTAATGAA GATCCATTCG 

3781 TTTGTGAATA TCAAGGCCAA TCGTCTGACC TGCCTCAACC TCCTGTCAAT GCTGGCGGCG 

10 3841 GCTCTGGTGG TGGTTCTGGT GGCGGCTCTG AGGGTGGTGG CTCTGAGGGT GGCGGTTCTG 

3901 AGGGTGGCGG CTCTGAGGGA GGCGGTTCCG GTGGTGGCTC TGGTTCCGGT GATTTTGATT 

3961 ATGAAAAGAT GGCAAACGCT AATAAGGGGG CTATGACCGA AAATGCCGAT GAAAACGCGC 

4021 TACAGTCTGA CGCTAAAGGC AAACTTGATT CTGTCGCTAC TGATTACGGT GCTGCTATCG 

4081 ATGGTTTCAT TGGTGACGTT TCCGGCCTTG CTAATGGTAA TGGTGCTACT GGTGATTTTG 

15 4141 CTGGCTCTAA TTCCCAAATG GCTCAAGTCG GTGACGGTGA TAATTCACCT TTAATGAATA 

4201 ATTTCCGTCA ATATTTACCT TCCCTCCCTC AATCGGTTGA ATGTCGCCCT TTTGTCTTTA 

4261 GCGCTGGTAA ACCATATGAA TTTTCTATTG ATTGTGACAA AATAAACTTA TTCCGTGGTG 

4321 TCTTTGCGTT TCTTTTATAT GTTGCCACCT TTATGTATGT ATTTTCTACG TTTGCTAACA 

4381 TACTGCGTAA TAAGGAGTCT TAATCATGCC AGTTCTTTTG GGTATTCCGT TATTATTGCG 

H : 20 4441 TTTCCTCGGT TTCCTTCTGG TAACTTTGTT CGGCTATCTG CTTACTTTTC TTAAAAAGGG 

Q 4501 CTTCGGTAAG ATAGCTATTG CTATTTCATT GTTTCTTGCT CTTATTATTG GGCTTAACTC 

m 45 61 AATTCTTGTG GGTTATCTCT CTGATATTAG CGCTCAATTA CCCTCTGACT TTGTTCAGGG 

'% 4621 TGTTCAGTTA ATTCTCCCGT CTAATGCGCT TCCCTGTTTT TATGTTATTC TCTCTGTAAA 

;f; 4681 GGCTGCTATT TTCATTTTTG ACGTTAAACA AAAAATCGTT TCTTATTTGG ATTGGGATAA 

HI 25 4741 ATAATATGGC TGTTTATTTT GTAACTGGCA AATTAGGCTC TGGAAAGACG CTCGTTAGCG 

ff; : 4 801 TTGGTAAGAT TCAGGATAAA ATTGTAGCTG GGTGCAAAAT AGCAACTAAT CTTGATTTAA 

£; 4861 GGCTTCAAAA CCTCCCGCAA GTCGGGAGGT TCGCTAAAAC GCCTCGCGTT CTTAGAATAC 

J; 4921 CGGATAAGCC TTCTATATCT GATTTGCTTG CTATTGGGCG CGGTAATGAT TCCTACGATG 

+ : 4981 AAAATAAAAA CGGCTTGCTT GTTCTCGATG AGTGCGGTAC TTGGTTTAAT ACCCGTTCTT 

a 30 5041 GGAATGATAA GGAAAGACAG CCGATTATTG ATTGGTTTCT ACATGCTCGT AAATTAGGAT 

|4 5101 GGGATATTAT TTTTCTTGTT CAGGACTTAT CTATTGTTGA TAAACAGGCG CGTTCTGCAT 

U: 5161 TAGCTGAACA TGTTGTTTAT TGTCGTCGTC TGGACAGAAT TACTTTACCT TTTGTCGGTA 

!r; 5221 CTTTATATTC TCTTATTACT GGCTCGAAAA TGCCTCTGCC TAAATTACAT GTTGGCGTTG 

W 5281 TTAAAT AT GG CGATTCTCAA TTAAGCCCTA CTGTT GAGCG TTGGCTTTAT ACTGGTAAGA 

If; 35 5341 ATTTGTATAA CGCATATGAT ACTAAACAGG CTTTTTCTAG TAATTATGAT TCCGGTGTTT 

£fc 5401 ATTCTTATTT AACGCCTTAT TTATCACACG GTCGGTATTT CAAACCATTA AATTTAGGTC 

pi 5461 AGAAGATGAA ATTAACTAAA ATATATTTGA AAAAGTTTTC TCGCGTTCTT TGTCTTGCGA 

5521 TTGGATTTGC ATCAGCATTT ACATATAGTT ATATAACCCA ACCTAAGCCG GAGGTTAAAA 

5581 AGGTAGTCTC TCAGACCTAT GATTTTGATA AATTCACTAT TGACTCTTCT CAGCGTCTTA 

40 5641 ATCTAAGCTA TCGCTATGTT TTCAAGGATT CTAAGGGAAA ATTAATTAAT AGCGACGATT 

5701 TACAGAAGCA AGGTTATTCA CTCACATATA TTGATTTATG TACTGTTTCC ATTAAAAAAG 

5761 GTAATTCAAA TGAAATTGTT AAATGTAATT AATTTTGTTT TCTTGATGTT TGTTTCATCA 

5821 TCTTCTTTTG CTCAGGTAAT TGAAATGAAT AATTCGCCTC TGCGCGATTT TGTAACTTGG 

5881 TATTCAAAGC AATCAGGCGA ATCCGTTATT GTTTCTCCCG ATGTAAAAGG TACTGTTACT 

45 5941 GT AT ATT CAT CTGACGTTAA ACCTGAAAAT CTACGCAATT TCTTTATTTC TGTTTTACGT 

6001 GCTAATAATT TTGATATGGT TGGTTCAATT CCTTCCATAA TTCAGAAGTA TAATCCAAAC 

6061 AATCAGGATT AT ATT GAT GA ATTGCCATCA TCTGATAATC AGGAAT AT GA TGATAATTCC 

6121 GCTCCTTCTG GTGGTTTCTT TGTTCCGCAA AATGATAATG TTACTCAAAC TTTTAAAATT 

6181 AATAACGTTC GGGCAAAGGA TTTAATACGA GTTGTCGAAT TGTTTGTAAA GTCTAATACT 

50 6241 TCTAAATCCT CAAATGTATT ATCTATTGAC GGCTCTAATC TATTAGTTGT TTCTGCACCT 

6301 AAAGATATTT TAGATAACCT TCCTCAATTC CTTTCTACTG TTGATTTGCC AACTGACCAG 

6361 ATATTGATTG AGGGTTTGAT ATTTGAGGTT CAGCAAGGTG ATGCTTTAGA TTTTTCATTT 

6421 GCTGCTGGCT CTCAGCGTGG CACTGTTGCA GGCGGTGTTA ATACTGACCG CCTCACCTCT 

6481 GTTTTATCTT CTGCTGGTGG TTCGTTCGGT ATTTTTAATG GCGATGTTTT AGGGCTATCA 

55 6541 GTTCGCGCAT TAAAGACTAA TAGCCATTCA AAAATATTGT CTGTGCCACG TATTCTTACG 

6601 CTTTCAGGTC AGAAGGGTTC TAT CT CTGTT GGCCAGAATG TCCCTTTTAT TACTGGTCGT 

6661 GTGACTGGTG AATCTGCCAA TGTAAATAAT CCATTTCAGA CGATTGAGCG TCAAAATGTA 

6721 GGTATTTCCA TGAGCGTTTT TCCTGTTGCA ATGGCTGGCG GTAATATTGT TCTGGATATT 
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6781 ACCAGCAAGG CCGATAGTTT GAGTTCTTCT ACTCAGGCAA GTGATGTTAT TACTAATCAA 

6841 AGAAGTATTG CTACAACGGT TAATTTGCGT GATGGACAGA CTCTTTTACT CGGTGGCCTC 

6901 ACT GAT TATA AAAACACTTC TCAAGATTCT GGCGTACCGT TCCTGTCTAA AATCCCTTTA 

6961 ATCGGCCTCC TGTTTAGCTC CCGCTCTGAT TCCAACGAGG AAAGCACGTT ATACGTGCTC 

5 7021 GTCAAAGCAA CCATAGTACG CGCCCTGTAG CGGCGCATTA AGCGCGGCGG GTGTGGTGGT 

7081 TACGCGCAGC GTGACCGCTA CACTTGCCAG CGCCCTAGCG CCCGCTCCTT TCGCTTTCTT 

7141 CCCTTCCTTT CTCGCCACGT TCGCCGGCTT TCCCCGTCAA GCTCTAAATC GGGGGCTCCC 

7201 TTTAGGGTTC CGATTTAGTG CTTTACGGCA CCTCGACCCC AAAAAACTTG ATTTGGGTGA 

7261 TGGTTCACGT AGTGGGCCAT CGCCCTGATA GACGGTTTTT CGCCCTTTGA CGTTGGAGTC 

20 7321 CACGTTCTTT AATAGTGGAC TCTTGTTCCA AACTGGAACA ACACTCAACC CTATCTCGGG 

7381 CTATTCTTTT GATTTATAAG GGATTTTGCC GATTTCGGAA CCACCATCAA ACAGGATTTT 

7441 CGCCTGCTGG GGCAAACCAG CGTGGACCGC TTGCTGCAAC TCTCTCAGGG CCAGGCGGTG 

7501 AAGGGCAATC AGCTGTTGCC CGTCTCACTG GTGAAAAGAA AAACCACCCT GGATCCAAGC 

7561 TTGCAGGTGG CACTTTTCGG GGAAATGTGC GCGGAACCCC TATTTGTTTA TTTTTCTAAA 

1 5 762 1 TACATTCAAA TATGTATCCG CTCATGAGAC AATAACCCTG ATAAATGCTT CAATAATATT 

7681 GAAAAAGGAA GAGTATGAGT ATTCAACATT TCCGTGTCGC CCTTATTCCC TTTTTTGCGG 

7741 CATTTTGCCT TCCTGTTTTT GCTCACCCAG AAACGCTGGT GAAAGTAAAA GATGCTGAAG 

78 01 ATCAGTTGGG CGCACGAGTG GGTTACATCG AACTGGATCT CAACAGCGGT AAGATCCTTG 

78 61 AGAGTTTTCG CCCCGAAGAA CGTTTTCCAA TGATGAGCAC TTTTAAAGTT CTGCTATGTC 

^ : 2 0 7921 ATACACTATT ATCCCGTATT GACGCCGGGC AAGAGCAACT CGGTCGCCGG GCGCGGTATT 

7981 CTCAGAATGA CTTGGTTGAG TACTCACCAG TCACAGAAAA GCATCTTACG GATGGCATGA 

£ 8041 CAGTAAGAGA ATTATGCAGT GCTGCCATAA CCATGAGTGA TAACACTGCG GCCAACTTAC 

^! 810 1 TTCTGACAAC GATCGGAGGA CCGAAGGAGC TAACCGCTTT TTTGCACAAC ATGGGGGATC 

4" 8161 ATGTAACTCG CCTTGATCGT TGGGAACCGG AGCTGAATGA AGCCATACCA AACGACGAGC 

W l - 25 8221 GTGACACCAC GATGCCTGTA GCAATGCCAA CAACGTTGCG CAAACTATTA ACTGGCGAAC 

ff; 8281 TACTTACTCT AGCTTCCCGG CAACAATTAA TAGACTGGAT GGAGGCGGAT AAAGTTGCAG 

r; 8341 GACCACTTCT GCGCTCGGCC CTTCCGGCTG GCTGGTTTAT TGCTGATAAA TCTGGAGCCG 

^ 8401 GTGAGCGTGG GTCTCGCGGT ATCATTGCAG CACTGGGGCC AGATGGTAAG CCCTCCCGTA 

* 8461 TCGTAGTTAT CTACACGACG GGGAGTCAGG CAACTATGGA TGAACGAAAT AGACAGATCG 

30 8521 CTGAGATAGG TGCCTCACTG ATTAAGCATT GGTAACTGTC AGACCAAGTT TACTCATATA 

r. 8581 TACTTTAGAT TGATTTAAAA CTTCATTTTT AATTTAAAAG GATCTAGGTG AAGATCCTTT 

C' 8641 TTGATAATCT CATGACCAAA ATCCCTTAAC GTGAGTTTTC GTTCCACTGT ACGTAAGACC 

Q 8701 CCCAAGCTTG TCGACTGAAT GGCGAATGGC GCTTTGCCTG GTTTCCGGCA CCAGAAGCGG 

f!j 8761 TGCCGGAAAG CTGGCTGGAG TGCGATCTTC CTGAGGCCGA TACTGTCGTC GTCCCCTCAA 

W\ 35 88 21 ACTGGCAGAT GCACGGTTAC GATGCGCCCA TCTACACCAA CGTAACCTAT CCCATTACGG 

i 8881 TCAATCCGCC GTTTGTTCCC ACGGAGAATC CGACGGGTTG TTACTCGCTC ACATTTAATG 

H 8941 TTGATGAAAG CTGGCTACAG GAAGGCCAGA CGCGAATTAT TTTTGATGGC GTTCCTATTG 

H 900 1 GTTAAAAAAT GAGCTGATTT AACAAAAATT TAACGCGAAT TTTAACAAAA TATTAACGTT 

9061 TACAATTTAA ATATTTGCTT ATACAATCTT CCTGTTTTTG GGGCTTTTCT GATTATCAAC 

40 9121 CGGGGTACAT ATGATTGACA TGCTAGTTTT ACGATTACCG TTCATCGATT CTCTTGTTTG 

9181 CTCCAGACTC TCAGGCAATG ACCTGATAGC CTTTGTAGAT CTCTCAAAAA TAGCTACCCT 

9241 CTCCGGCATG AATTTATCAG CTAGAACGGT TGAATATCAT ATTGATGGTG ATTTGACTGT 

9301 CTCCGGCCTT TCTCACCCTT TTGAATCTTT ACCTACACAT TACTCAGGCA TTGCATTTAA 

9361 AATATATGAG GGTTCTAAAA ATTTTTATCC TTGCGTTGAA ATAAAGGCTT CTCCCGCAAA 

45 9421 AGTATTACAG GGTCATAATG TTTTTGGTAC AACCGATTTA GCTTTATGCT CTGAGGCTTT 

9481 ATTGCTTAAT TTTGCTAATT CTTTGCCTTG CCTGTATGAT TTATTGGATG TT 
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Table 25: h3401-h2 captured Via CJ with BsmAI 
! 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 
ISAQDIQMTQSPATLS 
aGT GCA C aa gac ate cag atg acc cag tct cca gee acc ctg tct 
5 ! ApaLI... a gec acc ! L25,L6,L20,L2,L16,A11 
! Extender Bridge... 

I 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 
IVSPGERATLSCRASQ 
10 gtg tct cca ggg gaa agg gec acc etc tec tgc agg gec agt cag 

! 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 
ISVSNNLAWYQQKPGQ 
agt gtt agt aac aac tta gec tgg tac cag cag aaa cct ggc cag 



15 



l 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 
1VPRLLIYGASTRATD 
gtt ccc agg etc etc ate tat ggt gca tec acc agg gee act gat 



20 ! 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 
!IPARFSGSGSGTDFT 
ate cca gee agg ttc agt ggc agt ggg tct ggg aca gac ttc act 



! 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 
25 ILTISRLEPEDFAVYY 

etc acc ate age aga ctg gag cct gaa gat ttt gca gtg tat tac 



! 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 
ICQRYGSSPGWTFGQG 
p| 3 0 tgt cag egg tat ggt age tea ccg ggg tgg acg ttc ggc caa ggg 



35 



! 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 
1TKVEIKRTVAAPSVF 
acc aag gtg gaa ate aaa cga act gtg get gca cca tct gtc ttc 

! 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 
!IFPPSDEQLKSGTAS 
ate ttc ccg cca tct gat gag cag ttg aaa tct gga act gee tct 

40 ! 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 
1VVCLLNNFYPREAKV 
gtt gtg tgc ctg ctg aat aac ttc tat ccc aga gag gee aaa gta 

! 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 
45 1QWKVDNALQSGNSQE 

cag tgg aag gtg gat aac gee etc caa teg ggt aac tec cag gag 

! 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 
ISVTEQDSKDSTYSLS 
50 agt gtc aca gag cag gac age aag gac age acc tac age etc age 



! 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 
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ISTLTLSKADYEKHKV 
age acc ctg acg ctg age aaa gca gac tac gag aaa cac aaa gtc 

! 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 
IYACEVTHQGLSSPVT 
tac gec tgc gaa gtc acc cat cag ggc ctg age teg cct gtc aca 

! 211 212 213 214 215 216 217 218 219 220 221 222 223 
1KSFNKGECKGEFA 
aag age ttc aac aaa gga gag tgt aag ggc gaa ttc gc 
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Table 26: h3401-d8 KAPPA captured with CJ and BsmAl 

! 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 
ISAQDIQMTQSPATLS 
aGT GCA C aa gac ate cag atg acc cag tct cct gec acc ctg tct 

! ApaLI... Extender a gec acc ! L25,L6,L20,L2,L16>A11 

! A GCC ACC CTG TCT ! L2 

! 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 
1VSPGERATLSCRASQ 
gtg tct cca ggt gaa aga gec acc etc tec tgc agg gec agt cag 
! GTG TCT CCA GGG GAA AGA GCC ACC CTC TCC TGC ! L2 

! 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 
1NLLSNLAWYQQKPGQ 
aat ctt etc age aac tta gec tgg tac cag cag aaa cct ggc cag 

! 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 
IAPRLLIYGASTGAIG 
get ccc agg etc etc ate tat ggt get tec acc ggg gee att ggt 

! 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 
IIPARFSGSGSGTEFT 
ate cca gee agg ttc agt ggc agt ggg tct ggg aca gag ttc act 

! 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 
ILTISSLQSEDFAVYF 
etc acc ate age age ctg cag tct gaa gat ttt gca gtg tat ttc 

! 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 
ICQQYGTSPPTFGGGT 
tgt cag cag tat ggt acc tea ccg ccc act ttc ggc gga ggg acc 

! 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 
IKVEIKRTVAAPSVFI 
aag gtg gag ate aaa cga act gtg get gca cca tct gtc ttc ate 

! 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 
IFPPSDEQLKSGTASV 
ttc ccg cca tct gat gag cag ttg aaa tct gga act gee tct gtt 

! 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 
IVCPLNNFYPREAKVQ 
gtg tgc ccg ctg aat aac ttc tat ccc aga gag gee aaa gta cag 

! 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 
IWKVDNALQSGNSQES 
tgg aag gtg gat aac gec etc caa teg ggt aac tec cag gag agt 

! 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 
1VTEQDNKDSTYSLSS 
gtc aca gag cag gac aac aag gac age acc tac age etc age age 
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! 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 
ITLTLSKVDYEKHEVY 
acc ctg acg ctg age aaa gta gac tac gag aaa cac gaa gtc tac 

! 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 
IACEVTHQGLSSPVTK 
gec tgc gaa gtc acc cat cag ggc ctt age teg ccc gtc acg aag 

! 211 212 213 214 215 216 217 218 219 220 221 222 223 
ISFNRGECKKEFV 
age ttc aac agg gga gag tgt aag aaa gaa ttc gtt t 
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Table 27. V3-23 VH framework with variegated codons shown 



10 



15 



20 



25 



30 



35 



40 



45 



17 18 19 20 21 22 
A Q P A M A 
5 f - ctg tct gaa cG GCC cag ccG GCC atg gcc 

3'-gac aga ctt gc egg gtc ggc egg tac egg 

Scab Sfil 

NgoML. 
NcoL... 



29 



FRl(DP47/V3-23) 

23 24 25 26 27 28 29 30 
EVQLLESG 
gaa|gtt|CAA|TTG|tta|gag|tct|ggt| 

ctt|caa|gtt|aac|aat|ctc|aga|cca| 
{ Mfel i 



53 



-FR1- 



31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 
GGLVQPGGSLRLSCA 
Iggclggtlcttlgttl caglcctlggtlggtltctlttaic gticttitctitgcjgctl 98 
|ccg|cca|gaa|caa|gtc|gga|cca|cca|aga|aat|gca|gaa|aga|acg|cga| 

Sites to be varied— > *** *** *** 

_FR1 >|...CDR1 |— FR2 

46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 
ASGFTFSSYAMSWVR 
|gct)TCClGGAlttclactlttc| tctltCGlTAC|Gct|atg|tct |tgg|gttlcgCl 
|cga|agg|cct|aag|tga|aag|aga|agc|atg|cga|tac|aga|acc|caa|gcg| 
| BspEI | | BsiWI| IBstXI. 

Sites to be varies— > *** *** *** 
FR2 >|...CDR2 



143 



61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 
QAPGKGLEWVSAISG 
>CAa|gct|ccTlGGt |aaa lggtlttglgagltggjgtt|tct |gctlatc|tct|ggti 
|gtt|cga|gga|cca|ttt[cca[aac[ctc|acc|caa|aga[cga|tag|aga|cca( 

..BstXI | 



188 



CDR2 1--FR3— 

76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 
SGGSTYYADSVKGRF 
|tctlggtlggc|agt|act|taclta t|gct|gac|tcc|gtt(aaa[ggt {cgc{ttcj 233 
|aga|cca|ccg|tca|tga|atg|atajcga|ctg|agg|caa|ttt|cca|gcg|aag| 



-FR3- 



50 



55 



91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 
TISRDNSKNTLYLQM 
iact|atc|TCT|AGA|gac|aac|tct|aag|aat|act|ctcitac|ttg|cag|atg| 278 
|tga|tag|aga|tct|ctg|ttg|aga|ttc|tta|tga|gag|atg[aac|gtc|tac| 
IXbal | 



— FR3- 



>| 

106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 
NSLRAEDTAVYYCAK 
iaac|a gC|TTAlAGg|gct|gag|gaclaCTl GCAlGtcltacitat|tgclgctlaaal 323 
|ttg|tcg|aat|tcc|cga|ctc|ctg|tga(cgt|cag|atg|ata|acg|cga|ttt| 
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10 



15 



o 



25 



30 



35 



40 



45 



[Afin | 

CDR3.. 



SPstli 



-FR4- 



121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 
DYEGTGYAFDIWGQG 
|gac]tat|gaa)ggt|act|ggtltat lgct|ttclgaClATAlTGg|ggt|ca a|ggtl 368 
|ctg|ata|ctt|cca|tga|cca|ata|cga|aag|ctg|tat|acc|cca|gtt|cca| 
| Ndel | 

FR4 ->| 

136 137 138 139 140 141 142 

T M V T V S S 
|act|atG|GTC|ACC|gtc|tct|agt- 389 
|tga|tac|cag|tgg|cag|agajtca- 
I BstEH I 



20 ! 



143 144 145 146 147 148 149 150 151 152 
ASTKGPSVFP 
gcc tec acc aaG GGC CCa teg GTC TTC ccc-3' 
egg agg t gg ttc ccg ggt age cag aag ggg -5' 

Bspl20I. BbsI...(2/2) 

ApaL... 



419 



(SFPRMET) 5'-ctg tct gaa cG GCC cag ccG-3' 
(TOPFR1 A) S'-ctg tct gaa cG GCC cag ccG GCC atg gec- 

gaa|gtt|CAA|TTG|tta|gag|tct|ggt|- 

|ggc|ggt|ctt|gtt|cag|cctiggt|ggtjtct|tta-3' 
(BOTFR1B) 3 f -caa[gtc|gga|cca|cca|aga|aat|gca|gaa|aga|acgjcgaj- 

jcga|agg|cctjaag|tga|aag-5' ! bottom strand 
(BOTFR2) 3'-acc|caa|gcg|- 

|gtt|cga|gga|cca|ttt|cca|aac|ctc|acc|caa|aga|-5' ! bottom strand 
(BOTFR3) 3'- a|cga|ctg|agg|caa|ttt|cca|gcg|aag|- 

|tga|tag|aga|tct|ctg|ttg|aga|ttc|tta|tga|gag|atg|aac|gtc|tac|- 
(ttg|tcg[aat|tcc|cga|ctc|ctg|tga-5 f 
(F06) S'-gCITTAlAGglgctlgaglgactaCTIGCAtGtcltacltatltgclgctlaaal- 

|gac|tat|gaa|ggt|act|ggt|tat|gct|ttc|gaC|ATA|TGgiggt|c-3' 
(BOTFR4) S'-cgalaaglctgltatlacclccalgttlccal- 

|tga|tac|cag|tgg|cag|aga|tca- 
egg agg tgg ttc ccg ggt age cag aag ggg~5' ! bottom strand 
(BOTPRCPRIM) 3»-gg ttc ccg ggt age cag aag ggg-5' 

CDR1 diversity 

(ON-vgCl) S'- lgctlTCClGGAIttclactlttcltct^lHTACKl^atg^l^- 

! CDR1 6859 

|tgg|gtt|cgC|CAa|gctlccUGG-3' 



<1> stands for an equimolar mix of {ADEFGfflKLMNPQRSTVWY} ; no C 
(this is not a sequence) 



5 0 ! CDR2 diversity 



(ON-vgC2) 5 , -ggt|ttg|gag|tgg|gtt|tct|<2>|atc|<2>|<3>|- 

! CDR2 

|tct|ggt|ggc|<l>|act|<l>|tat|gct|gac|tcc|gtt|aaa)gg-3' 

55 ! CDR2 

! <1> is an equimolar mixture of { ADEFGHDCLMNPQRST VWY} ; no C 
! <2> is an equimolar mixture of {YRWVGS}; no ACDEFHIKLMNPQT 
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! <3> is an equimolar mixture of {PS}; no ACDEFGHIKLMNQRTVWY 
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Table 30: Oligonucleotides used to clone CDR1/2 diversity 
All sequences are 5' to 3'. 
l)ONJX>lBsp, 30 bases 

5 

AccTcAcTggcTTccggA 
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 



TTcAcTTTcTcT 
10 19 20 21 22 23 24 25 26 27 28 29 30 



2) ON_Brl2 5 42 bases 



V! 



AgAAAcccAcTccAAAcc 
15 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 

TTTAccAggAgcTTggcg 

19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 

20 A A c c c A 
37 38 39 40 41 42 



3)ON_CD2Xba, 51 bases 

25 ggAAggcAgTgATcTAgA 
*P 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 

U gATAgTgAAgcgAccTTT 

19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 



30 



Wr'i AAcggAgTcAgcATA 

O 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 

35 4) ON_BotXba, 23 bases 

ggAAggcAgTgATcTAgA 
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 



40 



g A T A g 
19 20 21 22 23 
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Table 31: Bridge/Extender Oligonucleotides 



0N_LamlaB7 (rc) 
ON_Lam2aB7 (rc) 
5 ON_Lam3lB7 (rc) 
ON_Lam3rB7 (rc) 
ON_LamHf lcBrg (rc) 
ON_LamHflcExt 
ON_LamHf2b2Brg (rc) 

10 ON_LamHf2b2Ext 

ON_LamHf2dBrg(rc) 
ON_LamHf2dExt 
ON_LamHf31Brg(rc) 
0N_LamHf31Ext 

15 ON_LamHf3rBrg(rc) 
ON_LamHf3rExt 
ON_lamPlePCR 
Consensus 



GTGCTGACTCAGCCACCCTC. 20 

GCCCTGACTCAGCCTGCCTC . 2 0 

GAGCTGACTCAGG . ACCCTGC 2 0 

GAGCTGACTCAGCCACCCTC . 20 

CCTCGACAGCGAAGTGCACAGAGCGTCTTGACTCAGCC 38 

CCTCGACAGCGAAGTGCACAGAGCGTCTTG 30 

CCTCGACAGCGAAGTGCACAGAGCGCTTTGACTCAGCC 38 

CCTCGACAGCGAAGTGCACAGAGCGCTTTG 30 

CCTCGACAGCTAAGTGCACAGAGCGCTTTGACTCAGCC 38 

CCTCGACAGCGAAGTGCACAGAGCGCTTTG 30 

CCTCGACAGCGAAGTGCACAGAGCGAATTGACTCAGCC 38 

CCTCGACAGCGAAGTGCACAGAGCGAATTG 30 

CCTCGACAGCGAAGTGCACAGTACGAATTGACTCAGCC 38 

CCTCGACAGCGAAGTGCACAGTACGAATTG 30 

CCTCGACAGCGAAGTGCACAG 21 



iii 

rr; 

Si 

s 

ft! 
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Table 32: Oligonucleotides used to make SSDNA locally 
double- stranded 

Adapters (8) 

H43HF3. 1-02#1 5 ' -cc gtg tat tac tgt gcg aga g-3' 

H43. 77. 97.1-03*2 5'-ct gtg tat tac tgt gcg aga g-3 

H43. 77. 97. 323*22 5*-cc gta tat tac tgt gcg aaa g-3 

H43. 77. 97. 330*23 5 ' -ct gtg tat tac tgt gcg aaa g-3 

H43.77. 97. 439*44 5 1 -ct gtg tat tac tgt gcg aga c-3 

H43. 77. 97. 551*48 5 ' -cc atg tat tac tgt gcg aga c-3 
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Table 33: Bridge/extender pairs 

Bridges (2) 
H4 3.XABrl 

5 ' ggtgtagtgaTCTAGtgacaactctaagaatactctctacttgcagatgaacagC 
TTtAGggctgaggacaCTGCAGtctactattgtgcgaga-3 ' 

H4 3.XABr2 

5 1 ggtgtagtgaTCTAGtgacaactctaagaatactctctacttgcagatgaacagC 
TTtAGggctgaggacaCTGCAGtctactattgtgcgaaa-3' 

Extender 
H43.XAExt 

5 ' ATAgTAgAcTgcAgTgTccTcAgcccTTAAgcTgTTcATcTgcAAgTAgAgAgTA 
TTcTTAgAgTTgTcTcTAgATcAcTAcAcc-3 1 
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Table 34: PCR primers 
Primers 

H4 3 . XAPCR2 gactgggTgTAgTgATcTAg 
Hucmnest cttttctttgttgccgttggggtg 
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Table 35: PCR program for amplification of 
heavy chain CDR3 DNA 

95 degrees C 5 minutes 

5 

95 degrees C 20 seconds 

60 degrees C 30 seconds repeat 20x 

72 degrees C 1 minute 

10 72 degrees C 7 minutes 
4 degrees C hold 

Reagents (100 ul reaction): 

Template 5ul ligation mix 

15 lOx PCR buffer lx 

Taq 5U 

dNTPs 200 uM each 

MgCl 2 2mM 
H43.XAPCR2-biotin 400 nM 

M: 20 Hucmnest 200 nM 

s 

^ - 



m 

m 
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Table 36: Annotated sequence of CJR DY3F7 (CJR-A05) 10251 bases 



Non-cutters 

Bell Tgatca 
BstZ17I GTAtac 
Fsel GGCCGGcc 
Pmel GTTTaaac 
RsrII CGgwccg 
Sgfl GCGATcgc 
StuI AGGcct 

cutters 



BsiWI Cgtacg 
Btrl CACgtg 
Hpal GTTaac 
Pmll CACgtg 
Sapl GCTCTTC 
SgrAI CRccggyg 
Xmal Cccggg 



BssSI Cacgag 
EcoRV GAT ate 
Mlul Acgcgt 
PpuMI RGgwccy 
SexAI Accwggt 
SphI GCATGc 



Enzymes that cut from 1 to 4 times and other features 



End of genes II and X 

Start gene V 

BsrGI Tgtaca 

BspMI Nnnnnnnnngcaggt 

-"- ACCTGCNNNNn 

End of gene V 

Start gene VII 

BsaBI GATNNnnatc 

Start gene IX 

End gene VII 

SnaBI TACgta 

BspHI Tcatga 

Start gene VIII 

End gene IX 

End gene VIII 

Start gene III 

EagI Cggccg 

Xbal Tctaga 

KasI Ggcgcc 

BsmI GAATGCN 

BseRI GAGGAGNNNNNNNNNN 

-"- NNnnnnnnnnctcctc 
AlwNI CAGNNNctg 

BspDI ATcgat 

Ndel CAtatg 

End gene III 

Start gene VI 

Afel AGCgct 

End gene VI 

Start gene I 

Earl CTCTTCNnnn 

-"- Nnnnngaagag 

Pad TTAATtaa 

Start gene IV 

End gene I 

BsmFI Nnnnnnnnnnnnnnngtccc 

MscI TGGcca 

Psil TTAtaa 

End gene IV 

Start ori 

NgoMIV Gccggc 

Banll GRGCYc 

Drain CACNNNgtg 

DrdI GACNNNNnngtc 

Aval Cycgrg 

PvuII CAGctg 





829 






843 




1 


1021 




3 


1104 


5997 


1 


2281 






1106 






1108 




2 


1149 


3967 




1208 






1211 




2 


1268 


7133 


3 


1299 


6085 




1301 






1304 






1522 






1578 




2 


1630 


8905 


0 
Z, 


1643 


8436 


4 


1650 


8724 


2 


1769 


9065 


2 


2031 


8516 


2 


7603 


8623 


3 


2210 


8072 


2 


2520 


9883 


3 


2716 


3796 




2846 






2848 




1 


3032 






3187 






3189 




2 


4067 


9274 


2 


6126 


8953 


1 


4125 






4213 






4235 




2 


5068 


9515 


3 


5073 


7597 


2 


5349 


5837 




5493 






5494 




3 


5606 


8213 


4 


5636 


8080 


1 


5709 




1 


5752 




2 


5818 


7240 


1 


5953 





9039 9120 



8182 



9847 



9160 



BsmBI CGTCTCNnnnn 3 
End ori region 

BamHI Ggatcc 1 
Hindlll Aagctt 3 
BciVI GTATCCNNNNNN 1 
Start bla 

Eco57l CTGAAG 2 
Spel Actagt 1 
Bcgl gcannnnnntcg 1 
Seal AGTact 1 
Pvul C GAT eg 1 
Fspl TGCgca 1 
Bgll GCCNNNNnggc 3 
Bsal GGTCTCNnnnn 1 
Ahdl GACNNNnngtc 1 
EamllOSI GACNNNnngtc 1 
End bla 

AccI GTmkac 2 

.Hindi GTYrac 1 
Sail Gtcgac 1 
Xhol Ctcgag 1 

! Start PlacZ region 

!End PlacZ region 

IPflMI CCANNNNntgg 1 

! RBSi 

[start M13~iii signal seq for LC 
!ApaLI Gtgcac 1 
lend M13-iii signal seq 
[Start light chain kappa L20:JK1 
IPflFI GACNnngtc 3 
ISbfl CCTGCAgg 1 
!PstI CTGCAg 1 
IKpnl GGTACc 1 
IXcml CCANNNNNnnnntgg 2 
!NsiI ATGCAt 2 
IBsgl ctgeae 1 
IBbsI gtcttc 2 
IBlpI GCtnagc 1 
!EspI GCtnagc 1 
!EcoO109I RGgnccy 2 
!Ecll36I GAGctc 1 
!SacI GAGCTc 1 
!End light chain 

!AscI GGcgcgcc 1 
IBssHII Gcgcgc 1 
1RBS2 

ISfil GGCCNNNNnggcc 1 
!NcoI Ccatgg 1 
! Start 3-23, FR1 

!MfeI Caattg 1 
IBspEI Tccgga 1 
[Start CDR1 
IStatt FR2 

[BstXl CCANNNNNntgg 2 
lEcoNI CCTNNnnnagg 2 
[Start FR3 

[Xbal Tctaga 2 
[Aflll Cttaag 1 
[Start CDR3 

[Aatll GACGTc 1 
[Start FR4 

[PshAI GACNNnngtc 2 



5964 8585 9271 

5993 

5994 

6000 7147 7384 

6077 

6138 

6238 7716 

6257 

6398 

6442 

6553 

6700 

6801 8208 8976 

6853 

6920 

6920 

6998 

7153 8048 

7153 

7153 

7240 

7246 

7381 

7382 

7405 

7418 

7470 

7471 

7472 

7489 8705 9099 

7542 

7543 

7581 

7585 9215 
7626 9503 
7809 

7820 8616 

8017 

8017 

8073 8605 

8080 

8080 

8122 

8126 

8127 

8147 

8207 

8218 

8226 

8232 

8298 

8316 

8331 

8339 8812 
8346 8675 
8373 

8436 1643 

8480 

8520 

8556 

8562 

8573 9231 



!BstEII Ggtnacc 


1 




! Start CHI 




o o y o 


iApal GGGCCc 


1 


ooUd 


!Bspl20I Gggccc 


1 

J. 


8 606 


! PspOMI Gggccc 


1 
1 


o u U O 


!AgeI Accggt 


X 


O D Z) J 


!Bsu36I CCtnagg 


O 
Z 




!End of CHI 




ft Qf) 9 


!NotI GCggccgc 


1 


Q Q Pi /I 
0 y U4 


! Start His6 tag 




P Q 1 ft 

o y i j 


[Start cMyc tag 




o yoi 


"Amber codon 






!NheI Gctagc 


1 


ft Qft R 

o y o o 


! Start M13 III Domain 3 




QQQ7 

o y y / 


!NruI TCGcga 


1 




!BstBI TTcgaa 


1 


y iy / 


!EcoRI Gaattc 


1 


yzuu 


!XcmI CCANNNNNnnnntgg 


1 


Q91 R 


IBstAPI GCANNNNntgc 


1 


Q ft ft 7 

y jj / 


ISacII CCGCgg 


1 


Qft £ft 

y j do 


•End Illstump anchor 




9455 


!AvrII Cctagg 


1 


9462 


!trp terminator 




9470 


!SwaI ATTTaaat 


1 


9784 


'Start gene II 




9850 


IBglll Agatct 


1 


9936 



1 


aat 


get 


act 


act 


att 


agt 


aga 


att 


gat 


gcc 


acc 


ttt 


tea 


get 


cgc 


gcc 

! gene ii continued 
49 cca aat gaa aat 


ata 


get 


aaa 


cag 


gtt 


att 


gac 


cat 


ttg 


cga 


aat 


gta 

97 


tct 


aat 


ggt 


caa 


act 


aaa 


tct 


act 


cgt 


teg 


cag 


aat 


tgg 


gaa 


tea 


act 

145 


gtt 


aTa 


tgg 


aat 


gaa 


act 


tec 


aga 


cac 


cgt 


act 


tta 


gtt 


gca 


tat 


tta 

193 


aaa 


cat 


gtt 


gag 


eta 


cag 


caT 


TaT 


att 


cag 


caa 


tta 


age 


tct 


aag 


cca 

241 


tec 


gca 


aaa 


atg 


acc 


tct 


tat 


caa 


aag 


gag 


caa 


tta 


aag 


gta 


etc 


tct 

289 


aat 


cct 


gac 


ctg 


ttg 


gag 


ttt 


get 


tec 


ggt 


ctg 


gtt 


cgc 


ttt 


gaa 


get 

337 


cga 


att 


aaa 


acg 


cga 


tat 


ttg 


aag 


tct 


ttc 


ggg 


ctt 


cct 


ctt 


aat 


ctt 

385 


ttt 


gat 


gca 


ate 


cgc 


ttt 


get 


tct 


gac 


tat 


aat 


agt 


cag 


ggt 


aaa 


gac 

433 


ctg 


att 


ttt 


gat 


tta 


tgg 


tea 


ttc 


teg 


ttt 


tct 


gaa 


ctg 


ttt 


aaa 


gca 

481 


ttt 


gag 


ggg 


gat 


tea 


ATG 


aat 


att 


tat 


gac 


gat 


tec 


gca 


gta 


ttg 


gac 
i 

529 


get 


ate 


cag 


tct 


aaa 


Start gene 
cat ttt act 


x, ii continues 
att acc ccc tct 


ggc 


aaa 


act 



tct 

577 ttt gca aaa gcc tct cgc tat ttt ggt ttt tat cgt cgt ctg gta 

aac 

625 gag ggt tat gat agt gtt get ctt act atg cct cgt aat tec ttt 
673 cgt tat gta tct gca tta gtt gaa tgt ggt att cct aaa tct caa 

ctg 

721 atg aat ctt tct acc tgt aat aat gtt gtt ccg tta gtt cgt ttt 

att 



245 



-si- 



V 



0 

n\ 

m 



10 



15 



20 



gtt 



7 69 aac gta gat ttt tct tec caa cgt cct gac tgg tat aat gag cca 

317 ctt aaa ate gca TAA 

End X & II 

832 ggtaattca ca 



i Ml E5 Q10 T15 

843 ATG att aaa gtt gaa att aaa cca tct caa gee caa ttt act act 

cgt 

Start gene V 

S17 S20 P25 E30 

891 tct ggt gtt tct cgt cag ggc aag cct tat tea ctg aat gag cag 



ctt 
i 



V35 E40 V45 

939 tgt tac gtt gat ttg ggt aat gaa tat ccg gtt ctt gtc aag att 



act 



D50 A55 L60 

987 ctt gat gaa ggt cag cca gee tat gcg cct ggt cTG TAC Ace gtt 



cat 

! 
! 

25 R8 0 



BsrGI. 



L65 



V70 



S75 



1035 ctg tec tct ttc aaa gtt ggt cag ttc ggt tec ctt atg att gac 
cgt 

j P85 K87 end of V 

30 1083 ctg cgc etc gtt ccg get aag TAA C 

i 

1108 ATG gag cag gtc gcg gat ttc gac aca att tat cag gcg atg 
! Start gene VII 

i 

35 1150 ata caa ate tec gtt gta ctt tgt ttc gcg ctt ggt ata ate 

VII and IX overlap. 

S2 V3 L4 V5 S10 

1192 get ggg ggt caa agA TGA gt gtt tta gtg tat tct ttT gee tct ttc 
40 gtt 

End VII 
i start IX 

L13 W15 G20 T25 

E29 

45 1242 tta ggt tgg tgc ctt cgt agt ggc att acg tat ttt acc cgt tta 

atg gaa 
j 

1293 act tec tc 

j 

50 ! stop of IX, IX and VIII overlap by four bases 

1301 ATG aaa aag tct tta gtc etc aaa gee tct gta gee gtt get acc 
etc 

! Start signal sequence of viii. 

i 

55 134 9 gtt ccg atg ctg tct ttc get get gag ggt gac gat ccc gca aaa 

gcg 

i mature VIII > 

1397 gec ttt aac tec ctg caa gee tea gcg acc gaa tat ate ggt tat 
gcg 

60 1445 tgg gcg atg gtt gtt gtc att 

1466 gtc ggc gca act ate ggt ate aag ctg ttt aag 



! bases 1499-1539 are probable promoter for iii 

1499 aaa ttc acc teg aaa gca ! 1515 

f -35 . . 

t 

1517 age tga taaaccgat acaattaaag gctccttttg 

i -10 

j 

1552 gagecttttt ttt GG AG At ttt ! S.D. uppercase, there may be 9 Ts 

i 

i < in signal sequence 

> 

i MKKLLFAI PLVVPF 

1574 caac GTG aaa aaa tta tta ttc gca att cct tta gtt gtt cct ttc ! 

1620 
I 

i YSGAAESHLDGA 
1620 tat tct ggc gCG GCC Gaa tea caT CTA GAc ggc gee 
EagI .... Xbal .... 



Domain 1 



I 


A 


E 


T 


V 


E 


S 


C 


L 


A 












1656 


get 


gaa 


act 


gtt 


gaa 


agt 


tgt 


tta 


gca 












i 

! K 


S 


H 


T 


E 


I 


s 


F 


T 


N 


V 


W K 


D 


D 


K 


T 

1683 aaA 


Tec 


cat 


aca 


gaa 


aat 


tea 


ttc 


aCT 


AAC 


GTC 


TGG AAA 


GAC 


GAC 




AAA ACt 






























! 

! L 


D 


R 


Y 


A 


N 


Y 


E 


G 


S 


L 


W N 


A 


T 


G 


V 

1734 tta 


gat 


cgt 


tac 


get 


aac 


tat 


gag 


ggC 


tgt 


ctg 


tgG AAT 


GCt 


aca 




ggc gtt 






















BsmI . . 




































! V 


V 


C 


T 


G 


D 


E 


T 


Q 


C 


Y 


G T 


W 


V 


P 


I 

1785 gta 


gtt 


tgt 


act 


ggt 


GAC 


GAA 


ACT 


CAG 


TGT 


TAC 


GGT ACA 


TGG 


GTT 




cct att 
i 






























! G 


L 


A 


I 


P 


E 


N 


















1836 ggg 


ctt 


get 


ate 


cct 


gaa 


aat 














































! E 


G 


G 


G 


S 


E 


G 


G 


G 


S 












1857 gag 


ggt 


ggt 


ggc 


tct 


gag 


ggt 


ggc 


ggt 


tct 












; 

! E 


G 


G 


G 


S 


E 


G 


G 


G 


T 












1887 gag 

! 


ggt 


ggc 


ggt 


tct 


gag 


ggt 


ggc 


ggt 


act 












! Domain 






























1917 aaa 


cct 


cct 


gag 


tac 


ggt 


gat 


aca 


cct 


att 


ccg 


ggc tat 


act 


tat 




ate aac 






























1968 cct 


etc 


gac 


ggc 


act 


tat 


ccg 


cct 


ggt 


act 


gag 


caa aac 


ccc 


get 





aat cct 

2019 aat cct tct ctt GAG GAG tct cag cct ctt aat act ttc atg ttt 
cag aat 

! BseRI . . 

2070 aat agg ttc cga aat agg cag ggg gca tta act gtt tat acg ggc 
act 

2118 gtt act caa ggc act gac ccc gtt aaa act tat tac cag tac act 
cct 



2166 gta tea tea aaa gec atg tat gac get tac tgg aac ggt aaa ttC 

AGA 
j 

AlwNI 

2214 GAC TGc get ttc cat tct ggc ttt aac gaG gat TTa ttT gtt tgt 
gaa 

! AlwNI 

2262 tat caa ggc caa teg tct gac ctg cct caa cct cct gtc aat get 

i 

2307 ggc ggc ggc tct 
! start L2 

2319 ggt ggt ggt tct 

2331 ggt ggc ggc tct 

2343 gag ggt ggt ggc tct gag gga ggc ggt tec 

2373 ggt ggt ggc tct ggt ! end L2 

! Many published sequences of M13-derived phage have a longer linker 
! than shown here by repeats of the EGGGS motif two more times. 

! Domain 3 

i sGDFDYEKMANANKGA 

2388 tec ggt gat ttt gat tat gaa aag atg gca aac get aat aag ggg 

get 
t 

! MTENADENALQSDAKG 

2436 atg acc gaa aat gec gat gaa aac gcg eta cag tct gac get aaa 

ggc 
i 

I KLDSVAT DYGAAMDGF 

2484 aaa ctt gat tct gtc get act gat tac ggt get get ate gat ggt 

ttc 
t 

i iGDVSGLANGNGATGD 

2532 att ggt gac gtt tec ggc ctt get aat ggt aat ggt get act ggt 

gat 
t 

J FAGSNSQMAQVGDGDN 

2580 ttt get ggc tct aat tec caa atg get caa gtc ggt gac ggt gat 
aat 

I SPLMNNFRQYLPSLPQ 

2628 tea cct tta atg aat aat ttc cgt caa tat tta cct tec etc cct 

caa 
i 

j SVECRPFVFGAGKPYE 

2676 teg gtt gaa tgt cgc cct ttt gtc ttt Ggc get ggt aaa cca tat 

gaa 
! 

j FSIDCDKINLFR 

2724 ttt tct att gat tgt gac aaa ata aac tta ttc cgt 
, End Domain 3 

i GVFAFLLYVAT FMYV 

F140 

2760 ggt gtc ttt gcg ttt ctt tta tat gtt gee acc ttt atg tat gta 
ttt 

1 start transmembrane segment 

i 

! S T F A N I L 

2808 tct acg ttt get aac ata ctg 
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R N K E S 
2829 cgt aat aag gag tct TAA ! stop of iii 
Intracellular anchor. 



; 




Ml 


P2 


V 


L 


L5 


G 


I 


P 


L 


L10 


L 


R 


F 


L 


G15 
2847 


tc 


ATG 


cca 


gtt 


ctt 


ttg 


ggt 


att 


ccg 


tta 


tta 


ttg 


cgt 


ttc 


etc 


ggt 
i 




Start VI 
























1 

2894 


ttc 


ctt 


ctg 


gta 


act 


ttg 


ttc 


ggc 


tat 


ctg 


ctt 


act 


ttt 


ctt 


aaa 


aag 
2942 


ggc 


ttc 


ggt 


aag 


ata 


get 


att 


get 


att 


tea 


ttg 


ttt 


ctt 


get 


ctt 


att 
2990 


att 


ggg 


ctt 


aac 


tea 


att 


ctt 


gtg 


ggt 


tat 


etc 


tct 


gat 


att 


age 


get 
3038 


caa 


tta 


ccc 


tct 


gac 


ttt 


gtt 


cag 


ggt 


gtt 


cag 


tta 


att 


etc 


ccg 


tct 
3086 


aat 


gcg 


ctt 


ccc 


tgt 


ttt 


tat 


gtt 


att 


etc 


tct 


gta 


aag 


get 


get 


att 
3134 


ttc 


att 


ttt 


gac 


gtt 


aaa 


caa 


aaa 


ate 


gtt 


tct 


tat 


ttg 


gat 


tgg 



gat 
i 

i Ml A2 V3 F5 L10 G13 

3182 aaa TAA. t ATG get gtt tat ttt gta act ggc aaa tta ggc tct gga 
! end VI Start gene I 

i KTLVSVGKIQDKIVA 
3228 aag acg etc gtt age gtt ggt aag att cag gat aaa att gta get 

j 

i GCKIATNLDLRLQNL 
3273 ggg tgc aaa ata gca act aat ctt gat tta agg ctt caa aac etc 

i 

i pQVGRFAKT PRVLRI 

3318 ccg caa gtc ggg agg ttc get aaa acg cct cgc gtt ctt aga ata 

i 

i PDKPSISDLLAIGRG 
3363 ccg gat aag cct tct ata tct gat ttg ctt get att ggg cgc ggt 

i 

I NDSYDENKNGLLVLD 
3408 aat gat tec tac gat gaa aat aaa aac ggc ttg ctt gtt etc gat 

I 

i ecGTWFNTRSWNDKE 
3453 gag tgc ggt act tgg ttt aat ace cgt tct tgg aat gat aag gaa 

i 

i RQPI I DWFLHARKLG 

3498 aga cag ccg att att gat tgg ttt eta cat get cgt aaa tta gga 

i 

i W D I IFLVQDLSIVDK 

3543 tgg gat att att ttt ctt gtt cag gac tta tct att gtt gat aaa 

i 

i QARSALAEHVVYCRR 
3588 cag gcg cgt tct gca tta get gaa cat gtt gtt tat tgt cgt cgt 

! 

I LDRITLPFVGTLYSL 
3633 ctg gac aga att act tta cct ttt gtc ggt act tta tat tct ctt 

i 

i ITGSKMPLPKLHVGV 
3678 att act ggc teg aaa atg cct ctg cct aaa tta cat gtt ggc gtt 

I 

i VKYGDSQLSPTVERW 



3723 


gtt 


aaa 


tat 


ggc 


gat 


tct 


caa 


tta 


age 


cct 


act 


gtt 


gag 


cgt 


tgg 


3768 


L 
ctt 


Y 
tat 


T 
act 


G 
ggt 


K 
aag 


N 
aat 


L 
ttg 


Y 
tat 


N 
aac 


A 
gca 


Y 
tat 


D 
gat 


T 
act 


K 
aaa 


Q 
cag 


3813 


A 
get 


F 
ttt 


S 
tct 


S 
agt 


N 
aat 


Y 
tat 


D 
gat 


S 
tec 


G 
ggt 


V 
gtt 


Y 
tat 


S 
tct 


Y 
tat 


L 
tta 


T 
acg 


3858 


P 
cct 


Y 
tat 


L 
tta 


S 
tea 


H 
cac 


G 
ggt 


R 
egg 


Y 
tat 


F 
ttc 


K 
aaa 


P 
cca 


L 
tta 


N 
aat 


L 
tta 


G 
ggt 


3903 


Q 
cag 


K 
aag 


M 
atg 


K 
aaa 


L 
tta 


T 

act 


K 
aaa 


I 

ata 


Y 
tat 


L 
ttg 


K 
aaa 


K 
aag 


F 
ttt 


S 
tct 


R 
cgc 


3948 


V 
gtt 


L 
ctt 


C 
tgt 


L 

ctt 


A 
gcg 


I 

att 


G 
gga 


F 
ttt 


A 
gca 


S 
tea 


A 
gca 


F 
ttt 


T 
aca 


Y 
tat 


S 
agt 


3993 


Y 
tat 


I 
ata 


T 
acc 


Q 
caa 


P 
cct 


K 
aag 


P 
ccg 


E 
gag 


V 
gtt 


K 
aaa 


K 
aag 


V 
gta 


V 
gtc 


S 
tct 


Q 
cag 


4038 


T 
acc 


Y 
tat 


D 
gat 


F 
ttt 


D 
gat 


K 
aaa 


F 
ttc 


T 
act 


I 

att 


D 
gac 


S 
tct 


S 
tct 


Q 
cag 


R 
cgt 


L 
ctt 


4083 


N 

aat 


L 
eta 


S 

age 


Y 
tat 


R 
cgc 


Y 
tat 


T 7 

V 

gtt 


T7 

c 

ttc 


f\ 

aag 


U 

gat 


q 

tct 


V 

l\ 

aag 


gga 


1Y 

aaa 


t 

JU 

TTA 
Pad 


4128 


I 

ATT 


N 
AAt 


S 
age 


D 
gac 


D 
gat 


L 
tta 


Q 
cag 


K 
aag 


Q 
caa 


G 
ggt 


Y 
tat 


S 
tea 


L 
etc 


T 
aca 


Y 
tat 



Pad 



i ilDLCTVSlKKGNSNE 
! iv Ml K 

4173 att gat tta tgt act gtt tec att aaa aaa ggt aat tea aAT Gaa 
j " Start 

IV 

! i I V K C N .End of I 

! iv L3 L N5* V 17 N F V10 

4218 att gtt aaa tgt aat TAA T TTT GTT 
! IV continued 

4243 ttc ttg atg ttt gtt tea tea tct tct ttt get cag gta att gaa 
atg 

4291 aat aat teg cct ctg cgc gat ttt gta act tgg tat tea aag caa 
tea 

4339 ggc gaa tec gtt att gtt tct ccc gat gta aaa ggt act gtt act 
gta 

4387 tat tea tct gac gtt aaa cct gaa aat eta cgc aat ttc ttt att 
tct 

4435 gtt tta cgt gcA aat aat ttt gat atg gtA ggt tcT aAC cct tec 
atT 

4483 att cag aag tat aat cca aac aat cag gat tat att gat gaa ttg 
cca 

4531 tea tct gat aat cag gaa tat gat gat aat tec get cct tct ggt 
ggt 

4579 ttc ttt gtt ccg caa aat gat aat gtt act caa act ttt aaa att 
aat 

4627 aac gtt egg gca aag gat tta ata cga gtt gtc gaa ttg ttt gta 
aag 

4 675 tct aat act tct aaa tec tea aat gta tta tct att gac ggc tct 
aat 

4723 eta tta gtt gtt agt gcT cct aaa gat att tta gat aac ctt cct 
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caa 



4771 


ttc 


ctt 


tcA 


act 


gtt 


gat 


ttg 


cca 


act 


gac 


cag 


ata 


ttg 


att 


gag 


ggt 
4819 


ttg 


ata 


ttt 


gag 


gtt 


cag 


caa 


ggt 


gat 


get 


tta 


gat 


ttt 


tea 


ttt 


get 
4867 


get 


ggc 


tct 


cag 


cgt 


ggc 


act 


gtt 


gca 


ggc 


ggt 


gtt 


aat 


act 


gac 


cgc 
4915 


etc 


acc 


tct 


gtt 


tta 


tct 


tct 


get 


ggt 


ggt 


teg 


ttc 


ggt 


att 


ttt 


aat 
































4963 
cat 
5011 


ggc 


gat 


gtt 


tta 


ggg 


eta 


tea 


gtt 


cgc 


gca 


tta 


aag 


act 


aat 


age 


tea 


aaa 


ata 


ttg 


tct 


gtg 


cca 


cgt 


att 


ctt 


acg 


ctt 


tea 


ggt 


cag 


aag 
5059 


ggt 


tct 


ate 


tct 


gtT 


GGC 


CAg 


aat 


gtc 


cct 


ttt 


att 


act 


ggt 


cgt 


gtg 

t 










MscI . . 




















5107 


act 


qqt 


qaa 


tct 


gee 


aat 


gta 


aat 


aat 


cca 


ttt 


cag 


acg 


att 


gag 


cgt 
5155 


caa 


aat 


gta 


ggt 


att 


tec 


atg 


age 


gtt 


ttt 


cct 


gtt 


gca 


atg 


get 


ggc 
5203 


ggt 


aat 


att 


gtt 


ctg 


gat 


att 


acc 


age 


aag 


gee 


gat 


agt 


ttg 


agt 


tct 
































5251 


tct 


act 


cag 


gca 


agt 


gat 


gtt 


att 


act 


aat 


caa 


aga 


agt 


att 


get 


aca 
5299 


acg 


gtt 


aat 


ttg 


cgt 


gat 


gga 


cag 


act 


ctt 


tta 


etc 


ggt 


ggc 


etc 


act 
































5347 


gat 


tat 


aaa 


aac 


act 


tct 


caG 


gat 


tct 


ggc 


gta 


ccg 


ttc 


ctg 


tct 


aaa 
































5395 


ate 


cct 


tta 


ate 


ggc 


etc 


ctg 


ttt 


age 


tec 


cgc 


tct 


gat 


tcT 


aac 



gag 

5443 gaa age acg tta tac gtg etc gtc aaa gca acc ata gta cgc gee 
ctg 

54 91 TAG eggegcatt 
! End IV 

5503 aagcgcggcg ggtgtggtgg ttacgcgcag cgtgaccgct acacttgcca 
gcgccctagc 

5563 gcccgctcct ttegctttet tcccttcctt tctcgccacg ttcGCCGGCt 
ttccccgtca 

! NgoMI . 

5623 agctctaaat cgggggctcc ctttagggtt ccgatttagt getttaegge 
acctcgaccc 

5683 caaaaaactt gatttgggtg atggttCACG TAGTGggcca tcgccctgat 
agacggtttt 

! *" Drain 

5743 tcgccctttG ACGTTGGAGT Ccacgttctt taatagtgga ctcttgttcc 
aaactggaac 

! DrdI 

5803 aacactcaac cctatctcgg gctattcttt tgatttataa gggattttgc 
egatttegga 

5863 accaccatca aacaggattt tcgcctgctg gggcaaacca gcgtggaccg 
ettgetgeaa 

5923 ctctctcagg gecaggeggt gaagggcaat CAGCTGttgc cCGTCTCact 
ggtgaaaaga 

i PvuII. BsmBI. 

5983 aaaaccaccc tGGATCC AAGCTT 

! BamHI Hindlll (1/2) 

! Insert carrying bla gene 

6006 gcaggtg gcacttttcg gggaaatgtg cgcggaaccc 

6043 ctatttgttt atttttctaa atacattcaa atatGTATCC gctcatgaga 

caataaccct 

S BciVI 
6103 gataaatget tcaataatat tgaaaaAGGA AG Agt 



! RBS . ? 



Start bla gene 



6138 ATG 


agt 


att 


caa 


cat 


ttc cgt 


gtc 


gee 


ctt 


att 


ccc 


ttt 


ttt 


g c g 


gca ttt 




























6189 tgc 


ctt 


cct 


gtt 


ttt 


get cac 


cca 


gaa 


acg 


ctg 


gtg 


aaa 


gta 


aaa 


gat get 




























6240 gaa 


gat 


cag 


ttg 


ggC 


gcA CTA GTg 


ggt 


tac 


ate 


gaa 


ctg 


ga l 


etc 


aac age 




























i 










Spel . . 


















i 








ApaLI & BssSI Removed 












6291 ggt 


aag 


ate 


ctt 


gag 


agt ttt 


cgc 


ccc 


gaa 


gaa 


cgt 


ttt 


cca 


atg 


atg age 




























6342 act 


ttt 


aaa 


gtt 


ctg 


eta tgt 


GGC 


GcG 


Gta 




tec 


cgt 


-j -i- +- 

alt 


gac 


gee ggg 




























6393 caa 


gaG 


CAA 


CTC 


GGT 


CGc cgC ATA cAC 


tat 


tct 


cag 


aat 


gac 


ttg 


gtt gAG 




























j 


























Seal 




























64 4 4 TAC 


Tea 


cca 


gtc 


aca 


gaa aag 


cat 


ctt 


acg 


gat 


ggc 


atg 


aca 


gta 


aga gaa 




























! Seal. 


























6495 tta 


tgc 


agt 


get 


gee 


ata ace 


atg 


agt 


gat 


aac 


aCL 


gcg 


gec 


aac 


tta ctt 




























6546 ctg 


aca 


aCG 


ATC 


Gga 


gga ccg 


aag 


gag 


eta 


acc 


get 


4- +» +■ 
ttt 


ttg 


cac 


aac atg 




























i 




Pvul. . 






















6597 ggg 


gat 


cat 


gta 


act 


cgc ctt 


gat 


cgt 


tgg 


gaa 


ccg 


gag 


ctg 


aat 


gaa gee 


























Gca 


6648 ata 


cca 


aac 


gac 


gag 


cgt gac 


ace 


acg 


atg 


cct 


gta 


gca 


atg 


aca acg 




























6699 tTG 


CGC 


Aaa 


eta 


tta 


act ggc 


gaa 


eta 


ctt 


act 


eta 


get 


tec 


°gg 


caa caa 




























! Fspl.. 

i 


























6750 tta 


ata 


gac 


tgg 


atg 


gag gcg 


gat 


aaa 


gtt 


gca 


gga 


cca 


ctt 


ctg 


cgc teg 




























6801 GCC 


ctt 


ccG 


GCt 


ggc 


tgg ttt 


att 


get 


gat 


aaa 


tct 


gga 


gec 


ggt 


gag cgt 




























! Bgll . . . 


























6852 gGG 


TCT 


Cgc 


ggt 


ate 


att gca 


gca 


ctg 


ggg 


cca 


gat 


ggt 


aag 


ccc 


tec cgt 




























i Bsal . . 


























6903 ate 


gra 


gtt 


ate 


tac 


acG ACg 


ggg 


aGT 


Cag 


gca 


act 


atg 


gat 


gaa 



cga aat 

! Ahdl 



6954 aga cag ate get gag ata ggt gec tea ctg att aag cat tgg TAA 
ctgt 

i stop 

7003 cagaccaagt ttactcatat atactttaga ttgatttaaa acttcatttt 
taatttaaaa 

7063 ggatctaggt gaagatcctt tttgataatc tcatgaccaa aatcccttaa 
cgtgagtttt 

7123 cgttccactg taegtaagae cccc 

7147 AAGCTT GTCGAC tgaa tggcgaatgg cgctttgcct 
S Hindi I I Sail.. 

! (2/2) Hindi 

7183 ggtttccggc accagaagcg gtgccggaaa gctggctgga gtgegatett 

! Start of Fab-display cassette, the Fab DSR-A05 , selected for 
! binding to a protein antigen. 

7233 CCTGAcG CTCGAG 
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! xBsu36I Xhol. . 

! PlacZ promoter is in the following block 

7246 cgcaacgc aattaatgtg agttagctca 

7274 ctcattaggc accccaggct ttacacttta tgcttccggc tcgtatgttg 

7324 tgtggaattg tgagcggata acaatttcac acaggaaaca gctatgacca 

7374 tgattacgCC AagcttTGGa gccttttttt tggagatttt caac 

! PflMI 

! Hind3. (there are 3) 

1 Gene iii signal sequence: 

i i 2 3 4 5 6 7 8 9 10 11 12 13 14 15 

i MKKLLFAI PLVVPFY 

7418 gtg aaa aaa tta tta ttc gca att cct tta gtt gtt cct ttc tat 

i 

i 16 17 18 Start light chain (L20:JK1) 

! SHSAQDIQMTQSPA 

74 63 tct cac a GT GCA Caa qac ate caq ata acc cao tct cca gec 

! ApaLI. . . 

! Sequence supplied by extender 

i 

! T L S L 

7505 acc ctg tct ttg 

i 

I SPGERATLSCRASQG 

7517 tct cca ggg gaa aga gec acc etc tec tgc agg gec agt cag Ggt 

j 

i VSSYLAWYQQKPGQA 

7562 gtt age age tac tta gec tgg tac cag cag aaa cct ggc cag get 

j 

! PRLLIYDASSRATGI 

7607 ccc agg etc etc ate tat gAt gca tec aAc agg gec act ggc ate 

i 

I PARFSGSGPGTDFTL 

7 652 cca gCc agg ttc agt ggc agt ggg Cct ggg aca gac ttc act etc 

i 

j TISSLEPEDFAVYYC 

7697 acc ate age agC ctA gag cct gaa gat ttt gca gtT tat tac tgt 

j 

j QQRSWHPWT FGQGTR 

77 42 cag cag CGt aAc tgg cat ccg tgg ACG TTC GGC CAA GGG ACC AAG 

i 

t VEIKRTVAAPSVFIF 

7787 gtg gaa ate aaa cga act gtg gCT GCA Cca tct gtc ttc ate ttc 

j Bsgl .... 
I 

1 PPSDEQLKSGTASVV 

7832 ccg cca tct gat gag cag ttg aaa tct gga act gee tct gtt gtg 

I 

i CLLNNFYPREAKVQW 

7877 tgc ctg ctg aat aac ttc tat ccc aga gag gec aaa gta cag tgg 

j 

1 KVDNALQSGNSQESV 

7922 aag gtg gat aac gec etc caa teg ggt aac tec cag gag agt gtc 

i 

! TERDSKDSTYSLSST 

7967 aca gag egg gac age aag gac age acc tac age etc age age acc 

i 

i LTLSKADYEKHKVYA 

8012 ctg acG CTG AGC aaa gca gac tac gag aaa cac aaa gtc tac gee 

! Espl 
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Pi 

y ; 

I** 
III 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



8057 



8102 



8126 



8160 



8205 



8250 

1 
j 

> 
j 
1 

j 

BstXI. 

1 

t 

> 
1 



8295 



8340 
BstXI. 



> 
I 

! 



8385 



8430 



8475 



CEVTHQGLSSPVTKS 
tgc gaa gtc ace cat cag ggc ctG AGC TCg ccc gtc aca aag age 

Sad . . . . 

F N R G E C 
ttc aac agg gga gag tgt taa taa 

GGCGCG CCaattctat ttcaaGGAGA cagtcata 
AscI RBS2. 

PelB signal sequence (22 codons) > 

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 
MKYLLPTAAAGLLLL 
atg aaa tac eta ttg cct acg gca gee get gga ttg tta tta etc 



. . . PelB signal 

16 17 18 19 20 21 
A A Q P A M 



--> Start VH, FR1 > 

22 23 24 25 26 27 28 29 30 
AEVQLLESG 



gcG GCC cag ccG GCC atg gee gaa gtt CAA TTG tta gag tct ggt 



Sfil. 



Mfel. . 



Ncol. . 



31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 
GGLVQPGGSLRLSCA 
ggc ggt ctt gtt cag cct ggt ggt tct tta cgt ctt tct tgc get 



,FR1- 



-> CDR1- 



-> FR2- 



46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 
ASGFTFSTYEMRWVR 
get TCC GGA ttc act ttc tct act tac gag atg cgt tgg gtt cgC 
BspEI . . 



FR2- 

61 
Q 



-> CDR2 



62 
A 



63 
P 



64 
G 



65 
K 



66 
G 



67 
L 



68 
E 



69 70 
W V 



71 
S 



72 
Y 



73 
I 



74 
A 



75 
P 



CAa get ccT GGt aaa ggt ttg gag tgg gtt tct tat ate get cct 



. . CDR2 > FR3 

76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 
SGGDTAYADSVKGRF 
tct ggt ggc gat act get tat get gac tec gtt aaa ggt cgc ttc 

91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 
TISRDNSKNTLYLQM 
act ate TCT AGA aac aac tct aag aat act etc tac ttg cag atg 
Xbal. . . 

Supplied by extender 

FR3 > 

106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 

NSLRAEDTAVYYCAR 
aac agC TTA AGg get gag aac act gca gtc tac tat tgt gcg agg 
Aflll. . . 

from extender > 



254 



FR4 — > 



m 



10 



15 



20 



25 



8520 



8565 



8595 
tec 
i 
i 

8640 
aag 
j 
i 

8685 
gec 
i 
j 

8730 
TCA 



CDR3 > 

121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 

RLDGYI SYYYGMDVW 
agg etc gat ggc tat att tec tac tac tac ggt atg GAC GTC tgg 

Aatll. . 

136 137 138 139 140 141 142 143 144 145 

GQGTTVTVSS 
ggc caa ggg acc acG GTC ACC gtc tea age 
BstEII... 

CHI of IgGl > 

ASTKGPSVFPLAPSS 
gec tec acc aag ggc cca teg gtc ttc ccc ctg gca ccc tec 



KSTSGGTAALGCLV 
aag age acc tct ggg ggc aca gcg gee ctg ggc tgc ctg gtc 



DYFPEPVTVSWNSG 
gac tac ttc ccc gaa ccg gtg acg gtg teg tgg aac tea ggc 



LTSGVHTFPAVLQS 
ctg acc age ggc gtc cac acc ttc ccg get gtc eta cag tCC 



%A 



w 

ru 

M: 



30 Bsu36I, 



35 



40 



8775 
age 

Bsu36I. 



8820 



age 

! 



8865 
GCC 



GLYSLSSVVTVPSS 
GGa etc tac tec etc age age gta gtg acc gtg ccc tec age 



LGTQTYICNVNHKP 
ttg ggc acc cag acc tac ate tgc aac gtg aat cac aag ccc 



NTKVDKKVEPKSCA 
aac acc aag gtg gac aag aaa gtt gag ccc aaa tct tgt GCG 



45 



50 



NotI 

| 
I 

8910 
ate 

! ..Not I. 
Tag 



AHHHHHHGAAEQKL 
GCa cat cat cat cac cat cac ggg gec gca gaa caa aaa etc 

. . . H6 tag Myc- 



! SEEDLNGAAqASSA 
8955 tea gaa gag gat ctg aat ggg gee gca tag GCT AGC tct get 

55 ! Myc-Tag ... Nhel. . . 

Amber 

III ' stump 

60 ! Domain 3 of III 
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! SGDFDYEKMANANKG 

8997 agt ggc gac ttc gac tac gag aaa atg get aat gec aac aaa GGC 
GCC 

! tec tttrtag acttgg 

5 t !W.T. 
i 

Kasl. . . (2/4) 

! 

! MTENADENALQS DAK 

10 9045 atG ACT GAG AAC GCT GAC GAG aat get ttg caa age gat gee aag 

ggt 

! catctacgcag tct eta 

c !W.T. 

i 

15! KLDSVAT DYGAAI DG 

9093 aag tta gac age gTC GCG Ace gac tat GGC GCC gee ATC GAc ggc 
ttt 

! a c t t tct tttcttt tt 

c !W.T. 

20 ! Nrul Kasl... (3/4) 

i 

! IGDVSGLANGNGATG 
9141 ate ggc gat gtc agt ggt tTG GCC Aac ggc aac gga gee ace gga 

M= gac 

jg 25 ! t t c t tec ccttttttttt 

Si t !W.T. 

^ ! MscI (3/3) 

Lfi ! FAGSNSQMAQVGDGD 

gl 30 918 9 ttc GCA GGT tcG AAT TCt cag atg gcC CAG GTT GGA GAT GGg gac 

aac 

! ttct ca tactctt 

t !W.T. 

* ! BspMI.. (2/2) Xcml 

M : 35 ! EcoRI... 

I 

%^ ' 

f§\ ! S PLMNNFRQYLPSLP 

;S 9237 agt ccg ctt atg aac aac ttt aga cag tac ctt ccg tct ctt ccg 

:!J cag 

0 40 ! tea tta t t cct a tta t c c t 

y a !W.T. 

! 

! SVECRPFVFSAGKPY 
9285 agt gtc gag tgc cgt cca ttc gtt ttc tct gee ggc aag cct tac 
45 gag 

! teg tatcttct age t t a a t 

a !W.T. 

I 

! FSIDCDKINLFR 
50 9333 ttc aGC Ate gac TGC gat aag ate aat ctt ttC CGC 

' t tct t t t c a a eta c t !W.T. 

! BstAPI SacII... 

I End Domain 3 

j 

55 ! GVFAFLLYVAT FMYV 

9369 GGc gtt ttc get ttc ttg eta tac gtc get act ttc atg tac gtt 
ttc 

! tctgtcttattcct ta 

t 1W.T. 

60 ! start transmembrane segment 

i 

! STFANIL RNKES 
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9417 aGC ACT TTC GCC AAT ATT TTA Cgc aac aaa gaa age 
! tct gttcacg ttgg tct !W.T. 

! Intracellular anchor, 

i 
i 

9453 tag tga tct CCT AGG 

! Avrll.. 
i 

9468 aag ccc gec taa tga gcg ggc ttt ttt ttt ct ggt 
! | Trp terminator j 

I 

! End Fab cassette 
i 

9503 AT G CAT CCTGAGG ccgat actgtegteg tcccctcaaa ctggcagatg 
! Nsil. . Bsu36I. (3/3) 

9551 cacggttacg atgcgcccat ctacaccaac gtgacctatc ecattaeggt 
caatccgccg 

9611 tttgttccca eggagaatec gacgggttgt tactcgctca catttaatgt 
tgatgaaagc 

9671 tggctacagg aaggecagae gcgaattatt tttgatggcg ttcctattgg 
ttaaaaaatg 

9731 agctgattta acaaaaattt aaTgcgaatt ttaacaaaat attaacgttt 
acaATTTAAA 
i 

Swal . . . 

9791 Tatttgctta tacaatcttc ctgtttttgg ggcttttctg attatcaacc 
GGGGTAcat 

9850 ATG att gac atg eta gtt tta cga tta ccg ttc ate gat tct ctt 
gtt tgc 

! Start gene II 

9901 tec aga etc tea ggc aat gac ctg ata gee ttt gtA GAT CTc tea 
aaa ata 

! Bglll... 

9952 get ace etc tec ggc atT aat tta tea get aga acg gtt gaa tat 
cat att 

10003 gat ggt gat ttg act gtc tec ggc ctt tct cac cct ttt gaa tct 
tta cct 

10054 aca cat tac tea ggc att gca ttt aaa ata tat gag ggt tct aaa 
aat ttt 

10105 tat cct tgc gtt gaa ata aag get tct ccc gca aaa gta tta cag 
ggt cat 

10156 aat gtt ttt ggt aca ace gat tta get tta tgc tct gag get tta 
ttg ctt 

10207 aat ttt get aat tct ttg cct tgc ctg tat gat tta ttg gat gtt ! 
! gene II continues 

I End 0 f Table 
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Table 37: DNA seq of w.t. M13 gene iii 

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 
fMKKLLFAI PLVVPFY 
1579 gtg aaa aaa tta tta ttc gca att cct tta gtt gtt cct ttc tat 
Signal sequence 

16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 
SHSAETVESCLAKPH 
1624 tct cac tec get gaa act gtt gaa agt tgt tta gca aaa ccc cat 
Signal sequence> Domain 1 

31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 
TENSFTNVWKDDKTL 
1669 aca gaa aat tea ttt act aac gtc tgg aaa gac gac aaa act tta 
Domain 1 

46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 
DRYANYEGCLWNATG 
1714 gat cgt tac get aac tat gag ggt tgt ctg tgG AAT GCt aca ggc 

BsmI .... 

Domain 1 

61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 
VVVCTGDETQCYGTW 
1759 gtt gta gtt tgt act ggt gac gaa act cag tgt tac ggt aca tgg 
Domain 1 

76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 
VPIGLAI PENEGGGS 
1804 gtt cct att ggg ctt get ate cct gaa aat gag ggt ggt ggc tct 
Domain 1 > Linker 1 

91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 
EGGGSEGGGSEGGGT 
184 9 gag ggt ggc ggt tct gag ggt ggc ggt tct gag ggt ggc ggt act 
Linker 1 > 

106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 
KPPEYGDTPIPGYTY 
1894 aaa cct cct gag tac ggt gat aca cct att ccg ggc tat act tat 
Domain 2 

121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 
INPLDGTYPPGTEQN 
1939 ate aac cct etc gac ggc act taT CCG CCt ggt act gag caa aac 

Ecil 

Domain 2 

136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 
PANPNPSLEESQPLN 
1984 ccc get aat cct aat cct tct ctt GAG GAG tct cag cct ctt aat 

BseRI . . 

Domain 2 

151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 
TFMFQNNRFRNRQGA 
2029 act ttc atg ttt cag aat aat agg ttc cga aat agg cag ggg gca 
Domain 2 

166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 
LTVYTGTVTQGTDPV 
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2074 tta act gtt tat acg ggc act gtt act caa ggc act gac ccc gtt 
Domain 2 

181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 
KTYYQYTPVSSKAMY 
2119 aaa act tat tac cag tac act cct gta tea tea aaa gec atg tat 
Domain 2 

196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 
DAYWNGKFRDCAFHS 
2164 gac get tac tgg aac ggt aaa ttC AG a gaC TGc get ttc cat tct 

AlwNI 

Domain 2 

211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 
GFNEDPFVCEYQGQS 
2209 ggc ttt aat gaG GAT CCa ttc gtt tgt gaa tat caa ggc caa teg 
BamHI... 

Domain 2 

226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 
SDLPQPPVNAGGGSG 
2254 tct gac ctg cct caa cct cct gtc aat get ggc ggc ggc tct ggt 
Domain 2 > Linker 2 

241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 
GGSGGGSEGGGSEGG 
2299 ggt ggt tct ggt ggc ggc tct gag ggt ggt ggc tct gag ggt ggc 
Linker 2 

256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 
GSEGGGSEGGGSGGG 
2344 ggt tct gag ggt ggc ggc tct gag gga ggc ggt tec ggt ggt ggc 
Linker 2 

271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 
SGSGDFDYEKMANAN 
238 9 tct ggt tec ggt gat ttt gat tat gaa aag atg gca aac get aat 
Linker 2> Domain 3 

286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 
KGAMTENADENALQS 
2434 aag ggg get atg ace gaa aat gee gat gaa aac gcg eta cag tct 
Domain 3 

301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 
DAKGKLDSVAT DYGA 
2479 gac get aaa ggc aaa ctt gat tct gtc get act gat tac ggt get 
Domain 3 

316 317 318 319 320 321 322 323 324 325 326 327 328 329 330 
AIDGFIGDVSGLANG 
2524 get ate gat ggt ttc att ggt gac gtt tec ggc ctt get aat ggt 
Domain 3 

! 331 332 333 334 335 336 337 338 339 340 341 342 343 344 345 

I NGATGDFAGSNSQMA 

2569 aat ggt get act ggt gat ttt get ggc tct aat tec caa atg get 

! Domain 3 



346 347 348 349 350 351 352 353 354 355 356 357 358 359 360 
QVGDGDNSPLMNNFR 
2614 caa gtc ggt gac ggt gat aat tea cct tta atg aat aat ttc cgt 
Domain 3 

361 362 363 364 365 366 367 368 369 370 371 372 373 374 375 
QYLPSLPQSVECRPF 
2659 caa tat tta cct tec etc cct caa teg gtt gaa tgt cgc cct ttt 
Domain 3 

376 377 378 379 380 381 382 383 384 385 386 387 388 389 390 
VFSAGKPYEFS I DCD 
2704 gtc ttt age get ggt aaa cca tat gaa ttt tct att gat tgt gac 
Domain 3 

391 392 393 394 395 396 397 398 399 400 401 402 403 404 405 
KINLFRGVFAFLLYV 
2749 aaa ata aac tta ttc cgt ggt gtc ttt gcg ttt ctt tta tat gtt 
Domain 3 > Transmembrane segment 

406 407 408 409 410 411 412 413 414 415 416 417 418 419 420 
ATFMYVFSTFANILR 
2794 gee ace ttt atg tat gta ttt tct acg ttt get aac ata ctg cgt 
Transmembrane segment > ICA- 

421 422 423 424 425 
N K E S 
2839 aat aag gag tct taa ! 2853 

ICA > ICA = intracellular anchor 
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Table 38: Whole mature III anchor M13-III 
derived anchor with recoded DNA 

! 

! 12 3 

! AAA 

1 GCG gcc gca 

! NotI 

t 

! 4 5 6 7 8 9 10 11 12 13 14 15 16 17 

i HHHHHHGAAEQKLI 

10 cat cat cat cac cat cac ggg gcc gca gaa caa aaa etc ate 

I 

! 18 19 20 21 22 23 24 25 26 27 28 29 

! SEEDLNGAA.AS 

52 tea gaa gag gat ctg aat ggg gcc gca Tag GCT AGC 

! Nhel... 

j 

! 30 31 32 33 34 35 36 37 38 39 

! DINDDRM AST 

88 GAT ATC aac gat gat cqt atq get tct act 
! (ON_G37bot) [RC] 5 ' - c aac gat gat cqt atq gcG CAt Get gcc gag aca 
g-3' 

! EcoRV.. 

! Enterokinase cleavage site, 

i 

! Start mature III (recoded) Domain 1 > 

! 40 41 42 43 

! A E T V 

118 IgcClgaGlacAlgtCj 
! t a t t ! W.T. 

! 

! 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 

! ESCLAKPHTENSFTN 

130 | gaa | TCC I tgC | CTG j GCC | AaG I ccT | caC I acT [ gaG I aat | AGT | ttC i aCA j Aat | 
! agt tta a a c t a a tea t t c 

! W.T. 

! MscI 

i 

! 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 

! VWKDDKTLDRYANYE 

17 5 | gtg | TGG | aaG | gaT | gaT | aaG | acC | CtT I gAT | CGA | TaT f gcC t aaT [ taC I gaA | 
! c accatta tctctg! 

W.T. 

! BspDI... 

j 

! 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 

! GCLWNATGVVVCTGD 

220 I ggC I tgC ! TtA I tgg j aat | gcC I ACC I GGC i GtC | gtT | gtC | TGC | ACG | ggC I gaT | 
! ttcg ta tattttc! 

W.T. 

! SgrAI Bsgl .... 

j 

! 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 

! ETQCYGTWVPIGLAI 

2 65 I gaG I acA | caA | tgC j taT | ggC | ACG t TGg | gtG I ccG | atA | gGC | TTA j GCC | atA | 
! atgtcta tttgette! 

W.T. 

! Pmll BlpI 
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b 
O 

s f 35 



rfj 
111 



10 



15 



25 



30 



35 



40 



45 



50 



55 



60 



W.T. 



Domain 1 > Linker 1 > 

104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 
PENEGGGSEGGGSEG 
310 I ccG i gaG I aaC I gaA | ggC I ggC I ggT I AGC I gaA i ggC I ggT I ggC | AGC | gaA | ggC | 
t a t g t t c tct g t c t tct g t 



W.T. 



20 W.T. 



Linker 1 > Domain 2 > 

119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 
GGSEGGGTKPPEYGD 
355 | ggT j GGA | TCC | gaA | ggA | ggT | ggA | acC I aaG f ccG | ccG | gaA i taT | ggC | gaC ! 
cttgtcttattgctt 

BamHI . . (2/2) 

134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 
TPIPGYTYINPLDGT 
4 00 | acT | ccG | atA | CCT | GGT [ taC i acC | taC | atT i aaT | ccG j TtA [ gaT | ggA | acC | 
attgctttcctcccct 



SexAI .... 

149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 
YPPGTEQNPANPNPS 
4 4 5 1 1 aC I ccT { ccG [ ggC | acC | gaA | caG | aaT | ccT | gcC | aaC | ccG i aaC | ccA | AGC | 
TGtttgaccttttt tct ! 



W.T. 



Hindlll . 



164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 
LEESQPLNTFMFQNN 
4 90 | TTA | gaA | gaA | AGC | caA | ccG | TtA | aaC | acC | ttT [ atg ! ttC | caA | aaC | aaC | 
! ctGGtctgtctttc tgtt 

W.T. 

Hindi II. 

179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 
RFRNRQGALTVYTGT 
535 I CgT | ttT | AgG i aaC | CgT I caA | gGT I GCT | CtT | acC | gTG | TAC 1 AcT | ggA | acC | 
! ag cca tag g g ata t t t g c t 

W.T. 

HgiAI... BsrGI... 

194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 
VTQGTDPVKTYYQYT 
58 0 | gtC i acC [ caG | GGT | ACC | gaT | ccT | gtC | aaG | acC i taC | taT | caA I taT i acC | 
ttactcctattcgct 



W.T. 
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Kpnl . . . 

209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 

PVSSKAMYDAYWNGK 
I ccG i gtC ! TCG i AGt | aaG I gcT | atg I taC ! gaT | gcC 1 taT | tgg | aaT | ggC i aaG i 
t a a tea ac tctc eta 



W.T. 



Bsal . 



Xhol. 



224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 
FRDCAFHSGFNEDPF 
670 | ttT | CgT | gaT | tgT 1 gcC t ttT | caC I AGC I ggT | ttC j aaC | gaa | gac 1 CCt [ ttT | 
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CAa C c t c t tct 



W.T. 



239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 
VCEYQGQSSDLPQPP 
715 | gtC I tgC | gaG j taC I caG | ggT | caG I AGT | AGC 1 gaT 1 TtA | ccG I caG I ccA | CCG I 
t t a t a c a teg tct c c g t a t t 



W.T. 
! Drdl. 
Agel . . . 



Domain 2 > Linker 2 > 

254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 
VNAGGGSGGGSGGGS 
7 60 I GTT | AAC | gcG | ggT | ggT | ggT | AGC | ggC I ggA | ggC I AGC | ggC | ggT | ggT j AGC i 
c t t c c c tct t t t tct tec tct 

W.T. 

Agel 



Hpal. . . 
Hindi. 

Linker 2 > 

Domain 3 — > 

! 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 

! EGGGSEGGGSGGGSG 
805 | gaAf ggC | ggA j ggT | AGC I gaA | ggA | ggT | ggC f AGC | ggA | ggC | ggT | AGC | ggC | 
g t t c tct g t c t tct g t c tct t 

W.T. 

Domain 3 > 

284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 
SGDFDYEKMANANKG 
850 | AGT | ggC ! gac I ttc [ gac [ tac I gag ! aaa I atg 1 get | aat | gee j aac | aaa | GGC | 
! tec tttttag acttgg! 

W.T. 
i 

KasI 

299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 
AMTENADENALQS DA 
8 95 I GCC | atg I act I gag | aac I get i gac I gaG | AAT t GCA t ctg | caa | agt 1 gat | gCC I 
! t catctacgag tct c t ! 

W.T. 

! KasI .... Bsml. . . . 

Styl. . . 

314 315 316 317 318 319 320 321 322 323 324 325 326 327 328 
KGKLDSVATDYGAAI 
94 0 | AAG | GGt 1 aag | tta | gac | age i gTC ! GCc | Aca ! gac j tat 1 ggT | GCt | gee | ate ! 
! a c a c t t tct t t t c t 

W.T. 

Styl PflFI 

329 330 331 332 333 334 335 336 337 338 339 340 341 342 343 
DGFIGDVSGLANGNG 
985 | gac I ggc t ttt I ate I ggc I gat 1 gtc | agt | ggt j ctg | get i aac I ggc I aac i gga | 
! t t c t t c t tec c c t t t t t 

W.T. 
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! 344 345 346 347 348 349 350 351 352 353 

! ATGDFAGSNS 
1030 | gcc i ace | gga I gac | ttc ! GCA | GGT j tcG j AAT | TCt | 

ttttttct c! W.T. 

BstBI . . . 

EcoRI . . . 

BspMI . . 

354 355 356 357 358 359 360 361 362 363 
QMAQVGDGDN 
1060 cag atg gcC CAG GTT GGA GAT GGg gac aac 

a tactcttt! W.T. 
Xcml 

364 365 366 367 368 369 370 371 372 373 374 375 376 377 378 

379 

! SPLMNNFRQYLPSLP 

1090 agt ccg ctt atg aac aac ttt aga cag tac ctt ccg tct ctt ccg 
cag 

! tea tta t t cct a tta t c c t 

a ! W.T. 

i 

! 380 381 382 383 384 385 386 387 388 389 390 391 392 393 394 

395 

! SVECRPFVFSAGKPY 

1138 agt gtc gag tgc cgt cca ttc gtt ttc tct gcc ggc aag cct tac 
gag 

! teg tatcttct age t t a a t 

a ! W.T. 

Domain 3 > 

396 397 398 399 400 401 402 403 404 405 406 407 
FSIDCDKINLFR 
1186 ttc aGC Ate gac TGC gat aag ate aat ctt ttC CGC 
t tct tttcaacta t 
BstAPI SacII... 

transmembrane segment > 

408 409 410 411 412 413 414 415 416 417 418 419 420 421 422 

423 

! GVFAFLLYVATFMYV 

1222 GGc gtt ttc get ttc ttg eta tac gtc get act ttc atg tac gtt 
ttc 

! tctgtctt'attcct ta 

t ! W.T. 

424 425 426 427 428 429 430 431 432 433 434 435 
STFANIL RNKES 
1270 aGC ACT TTC GCC AAT ATT TTA Cgc aac aaa gaa age 

tct gttcacg ttgg tct ! W.T. 

Intracellular anchor. 



1306 tag tga tct CCT AGG 

Avrll. . 

1321 aag ccc gcc taa tga gcg ggc ttt ttt ttt ct ggt 
I Trp terminator I 

End Fab cassette 

End of Table 
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Table 39: ONs to make deletions in III 

! ONs for use with iVhel 

i 

N 

(ON_G2 9bot) 5'-c gTT gAT ATc gcT Age cTA Tgc 

3' ! 22 

! this is the reverse complement of 5 ' -gca tag get age gat ate aac g 
3' 

! Nhel . . . scab 

(ON_G104top ) 5 1 -g | ata I ggc I tta | gcT j aGC j ccg j gag | aac | gaa | gg-3 1 
! 30 

! Scab Nhel... 104 105 106 107 108 

(ON_G236top) 5 1 -c | ttt | cac | age | ggt | ttc | GCT | AGC 1 gac | cct f ttt | gtc I tgc- 3 T 
! 37 

! Nhel. . . 236 237 238 239 240 

(ON_G236tCS) 5 1 -c ! ttt i cac i age I ggt | ttc I GCT j AGC I gac [ cct | ttt I gtc | Agc- 
! Nhel... 236 237 238 239 240 

gag i tac I cag I ggt I c-3 ' 

! 50 



! ONs for use with SphI G CAT Gc 

(ON_X37bot) 5'-gAc TgT cTc ggc Age ATg cgc cAT Acg ATc ATc gTT 

g-3 1 ! 37 

! NDDRMAHA 

! (ON_X37bot) =[RC] 5'-c aac gat gat cgt atg qc G CAt Ge t gee gag aca 

gtc-3' 

! SphI. . . .Scab 

(ON_X104top) 5 1 -g [ gtG ccg | ata | ggc | ttG I CAT | GCa | ccg | gag | aac | gaa | gg-3 1 
! 36 

! Scab SphI 104 105 106 107 108 

(ON_X236top) 5 ' -c | ttt | cac | age I ggt 1 ttG I CaT I gCa I gac I cct | ttt I gtc I tgc-3 ' 
! 37 

! SphI 236 237 238 239 240 

(ON_X236tCS ) 5 ' -c | ttt | cac j age | ggt I ttG | CaT | gCa | gac | cct ! ttt | gtc j Agc- 
! ~" Nhel. . . 236 237 238 239 240 

gag | tac | cag | ggt I c-3 1 

! 50 
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Table 40: Phage titers and enrichments of selections with 
a DY3F31-based human Fab library 





Input (total cfu) 


Output (total cfu) 


Output/input 
ratio 


Rl-ox selected on 
phOx-BSA 


4,5 x 10 12 


3,4 x 10 5 


7,5 x 10' 8 


R2-Strep selected 
on Strep-beads 


9,2 x 10 12 


3xl0 8 


3,3 x lO" 5 
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Table 41: Frequency of ELISA positives in 
DY3F31-based Fab libraries 





Anti-M13 HRP 


9E10/RAM- 
HRP 


Anti-CK/CL 
Gar-HRP 


R2-ox (with IPTG induction) 


18/44 


10/44 


10/44 


R2-ox (without IPTG) 


13/44 


ND 


ND 


R3-strep (with IPTG) 


39/44 


38/44 


36/44 


R3-strep (without IPTG) 


33/44 


ND 


ND 



